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Abstract 

Leader election is one of the fundamental problems in distributed computing. It calls for all 
nodes of a network to agree on a single node, called the leader. If the nodes of the network have 
distinct labels, then agreeing on a single node means that all nodes have to output the label 
of the elected leader. If the nodes of the network are anonymous, the task of leader election is 
formulated as follows: every node v of the network must output a simple path, which is coded 
as a sequence of port numbers, such that all these paths end at a common node, the leader. In 
this paper, we study deterministic leader election in anonymous trees. 

Our aim is to establish tradeoffs between the allocated time r and the amount of information 
that has to be given a priori to the nodes to enable leader election in time r in all trees for 
which leader election in this time is at all possible. Following the framework of algorithms with 
advice, this information (a single binary string) is provided to all nodes at the start by an oracle 
knowing the entire tree. The length of this string is called the size of advice. For a given time 
T allocated to leader election, we give upper and lower bounds on the minimum size of advice 
sufficient to perform leader election in time r. 

For most values of r, our upper and lower bounds are either tight up to multiplicative 
constants, or they differ only by a logarithmic factor. Let T be an n-node tree of diameter 
diam < D. While leader election in time diam can be performed without any advice, for time 
diam — 1 we give tight upper and lower bounds of 0(log 13). For time diam — 2 we give tight 
upper and lower bounds of 0(logZ3) for even values of diam, and tight upper and lower bounds 
of 0(logn) for odd values of diam. Moving to shorter time, in the interval [j3 ■ diam, diam — 3] 
for constant /3 > 1/2, we prove an upper bound of ” ) and a lower bound of the 

latter being valid whenever diam is odd or when the time is at most diam — 4. Hence, with the 
exception of the special case when diam is even and time is exactly diam — 3, our bounds leave 
only a logarithmic gap in this time interval. Finally, for time a ■ diam for any constant a < 1/2 
(except for the case of very small diameters), we again give tight upper and lower bounds, this 
time 0(n). 
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1 Introduction 


Background. Leader election is one of the fundamental problems in distributed computing [33]. 
It calls for all nodes of a network to agree on a single node, called the leader. This classic problem 
was first formulated in |32| in the study of local area token ring networks, where, at all times, 
exactly one node (the owner of a circulating token) has the right to initiate communication. When 
the token is accidentally lost, a leader is elected as the initial owner of the token. 

If the nodes of the network have distinct labels, then agreeing on a single node means that all 
nodes have to output the label of the elected leader. However, in many applications, even if nodes 
have distinct identities, they may be reluctant to reveal them, e.g., for privacy or security reasons. 
Hence it is important to design leader election algorithms that do not depend on the knowledge 
of such labels and that can work in anonymous networks as well. Under this scenario, agreeing 
on a single node means that every node has to output a simple path (coded as a sequence of port 
numbers) to a common node. 

Model and Problem Description. The network is modeled as an undirected connected graph 
with n nodes and with diameter diam at most D. In this paper, we restrict attention to tree 
networks, i.e., connected networks without cycles. We denote by diam{T) the diameter of tree T. 
Nodes do not have any identifiers. On the other hand, we assume that, at each node v, each edge 
incident to v has a distinct port number from {0,...,(i—1}, where d is the degree of v. Hence each 
edge has two corresponding port numbers, one at each of its endpoints. Port numbering is local to 
each node, i.e., there is no relation between port numbers at the two endpoints of an edge. Initially 
each node knows only its own degree. The task of leader election is formulated as follows. Every 
node V of the tree must output a sequence P{v) = {pi,... ,pk) of nonnegative integers. For each 
node V, let P*{v) be the path starting at v that results from taking the number pi from P{v) as the 
outgoing port at the node of the path. All paths P*{v) must be simple paths in the tree that 
end at a common node, called the leader. In this paper, we consider deterministic leader election 
algorithms. 

Note that, in the absence of port numbers, there would be no way to identify the elected leader by 
non-leaders, as all ports, and hence all neighbours, would be indistinguishable to a node. Security 
and privacy reasons for not revealing node identifiers are irrelevant in the case of port numbers. 

We use the extensively studied HOC AC communication model I37|. In this model, communication 
proceeds in synchronous rounds and all nodes start simultaneously. In each round, each node can 
exchange arbitrary messages with all of its neighbours and perform arbitrary local computations. 
For any tree T, any r > 0 and any node x in T, we use VT{x,r) to denote the view acquired in T 
by X within r communication rounds. This is all the information that v gets about the tree T in r 
rounds. Thus, the view VT{x,r) in T consists of the subtree of T induced by all nodes at distance 
at most r from x, together with all the port numbers at these nodes, and with the degrees of all 
nodes at distance exactly r from x. If no additional knowledge is provided a priori to the nodes, 
the decisions of x in round r in any deterministic algorithm are a function of VT{x,r). In most 
cases, a node’s view is considered in the underlying tree in which leader election is being solved, 
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and then the subscript T is omitted. The time of leader election is the minimum number of rounds 
sufficient to complete it by all nodes. 

It is well known that the synchronous process of the COCAC model can be simulated in an asyn¬ 
chronous network. This can be achieved by dehning for each node separately its asynchronous 
round i; in this round, a node performs local computations, then sends messages stamped i to all 
neighbours, and waits until it gets messages stamped i from all neighbours. To make this work, 
every node is required to send at least one (possibly empty) message with each stamp, until ter¬ 
mination. All of our results can be translated for asynchronous networks by replacing “time of 
completing a task” by “the maximum number of asynchronous rounds to complete it, taken over 
all nodes”. 

For anonymous trees, the task of leader election is not always feasible, regardless of the allocated 
time. This is the case when the tree is symmetric, i.e., when there exists a non-trivial port¬ 
preserving automorphism of it. Such an automorphism is defined as a bijection / : A —)• A, where 
A is the set of nodes, such that {x, y} is an edge with port numbers p at x and g at y if and 
only if {f{x),f{y)} is an edge with port numbers p at f{x) and q at f{y). It is easy to see that 
leader election is possible in a tree only if the tree is not symmetric. Symmetric trees are easy to 
characterize. Indeed, every tree has a centre which is either a node or an edge defined as follows. If 
the diameter diam is even, then the central node is the unique node in the middle of every simple 
path of length diam, and if the diameter diam is odd, then the central edge is the unique edge in 
the middle of every simple path of length diam. A tree is symmetric if and only if diam is odd, 
ports at the central edge are equal, and the two subtrees resulting from the deletion of the central 
edge are (port-preserving) isomorphic. For symmetric trees, the only non-trivial automorphism is 
the one switching the corresponding nodes of these subtrees, and this prevents leader election. 

Moreover, even in non-symmetric trees, leader election may be impossible if the allocated time is 
too short. Consider the line of length 6 with port numbers 0,0,1,1,0,0,1,1,0,1,0,0 (from left to right). 
If the allocated time is 1, then leader election is impossible even if nodes know a priori the entire 
map of the line. Indeed, neither of the two leaves knows whether it is the left or the right leaf and 
cannot learn this fact in time 1, and thus, leaves cannot output correct simple paths to a common 
node (the formal proof is slightly more complicated). Hence, for any non-symmetric tree T, it is 
important to introduce the parameter ^(T) defined as the minimum time in which leader election 
is feasible, assuming that each node is given the entire map of T with all port numbers faithfully 
mapped (but without the position of the node marked in the map). For the line T in the above 
example, ^(T) = 2. 

Our aim is to establish tradeoffs between the allocated time and the amount of information that 
has to be given a priori to the nodes to enable them to perform leader election. Following the 
framework of algorithms with advice, see, e.g., [iniiiaiisiiiaisiiiiTiEe], this information (a single 
binary string) is provided to all nodes at the start by an oracle knowing the entire tree. The length 
of this string is called the size of advice. Of course, since the faithful map of the tree is the total 
information about it, asking about the minimum size of advice to solve leader election in time r is 
meaningful only in the class of trees T for which f,{T) < r, becanse otherwise, no advice can help. 
In light of these remarks, we are able to precisely formulate the central problem of this paper. 
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For a given time r, what is the minimum size of advice that permits leader election in 
time r for all trees T where ^{T) < r? 

The paradigm of algorithms with advice has a far-reaching significance in the domain of network 
algorithms. Establishing a tight bound on the minimum size of advice sufficient to accomplish a 
given task permits to rule out entire classes of algorithms and thus focus only on possible candidates. 
For example, if we prove that 0(logn) bits of advice are needed to perform a certain task in n- 
node trees, this rules out all potential algorithms that can work using only the diameter diam of 
the tree, as diam can be given to the nodes using 0(log(dmm)) bits, and the diameter can be, 
e.g., logarithmic in the size of the tree. Lower bounds on the size of advice give us impossibility 
results based strictly on the amount of initial knowledge outlined in a model’s description. This 
more general approach should be contrasted with traditional results that focus on specific kinds of 
information available to nodes, such as the size, diameter, or maximum node degree. 

Our results. Let T be an n-node tree of diameter diam < D. For a given time r allocated to leader 
election, we give upper and lower bounds on the minimum size of advice sufficient to perform leader 
election in time r. An upper bound U means that, for all trees T with ^(T) < r, leader election in 
time r is possible given advice of size 0{U). We prove such a bound by constructing advice of size 
0{U) together with a leader election algorithm for all trees T with ^(T) < r that uses this advice 
and works in time r. A lower bound L means that there exist trees T with ^(T) < r for which 
leader election in time r requires advice of size n(L). Proving such a bound means constructing a 
class consisting of trees T with ^(T) < r for which no leader election algorithm running in time r 
with advice of size o{L) can succeed. 

For most values of r, our upper and lower bounds are either tight up to multiplicative constants, 
or they differ only by a logarithmic factor. More precisely, these bounds are the following. While 
leader election in time diam can be performed without any advice, for time diam — 1 we give tight 
upper and lower bounds of 0(logL)). For time diam — 2, we give tight upper and lower bounds of 
0(logi2) for even values of diam and tight upper and lower bounds of 0(logn) for odd values of 
diam. Moving to shorter time, in the interval [/3 • diam, diam — 3] for constant /? > 1/2, we prove 
an upper bound of and a lower bound of the latter valid whenever diam is odd or 

time is at most diam — 4. Hence, with the exception of the special case when diam is even and 
time is exactly diam —3, our bounds leave only a logarithmic gap in this time interval. (See section 
7 for a discussion of this special case.) Finally, for time a • diam for any constant a < 1/2 (except 
for the case of very small diameters, namely for diam G w(log^n)) we again give tight upper and 
lower bounds, this time 0(n). The above results are summarized in Figure 

Our results show that the minimum size of advice sufficient to perform leader election is very 
sensitive to the amount of time allocated to this task, and that this sensitivity occurs at different 
time values depending on the relation between the diameter and the size of the tree. If diam is odd 
and small compared to n, e.g., diam G O(logn), then a difference of one round (between diam — 1 
and diam — 2) causes an exponential jump of the size of information required for leader election, 
and anof/ier exponential jump occurs in this case between time diam —2 and diam —3. By contrast, 
for larger diameter, e.g., diam G Q{^/n), the first exponential jump disappears but the second still 
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Figure 1: Tradeoffs between time and size of advice in n-node trees with diameter diam < D 

holds. On the other hand, perhaps surprisingly, an exponential jump occurs at fixed time diam — 2 
when the diameter is small (e.g., logarithmic in n), depending only on the parity of the diameter. 

Related work. The leader election problem was introduced in [32]. This problem was first 
extensively studied in the scenario where all nodes have distinct labels. Initially, it was investigated 
for rings. A synchronous algorithm based on label comparisons and using 0(n log n) messages was 
given in |26] . In |19j it was proved that this complexity is optimal for comparison-based algorithms. 
On the other hand, the authors showed an algorithm using a linear number of messages but requiring 
very large running time. An asynchronous algorithm using O(relogn) messages was given, e.g., in 
|38j , and the optimality of this message complexity was shown in [8| . Deterministic leader election 
in radio networks has been studied, e.g., in [28l EU [35| , as well as randomized leader election, e.g., 
in |l0|. In [25|, the leader election problem was approached in a model based on mobile agents for 
networks with labeled nodes. 

Many authors O HlEl [H [T] SU 02] studied leader election in anonymous networks. In particular, 
|6lll2| characterize message-passing networks in which leader election can be achieved when nodes 
are anonymous. In |3T], the authors study the problem of leader election in general networks under 
the assumption that node labels are not unique. They characterize networks in which this can 
be done and give an algorithm which performs election when it is feasible. In |14) . the authors 
study feasibility and message complexity of leader election in rings with possibly nonunique labels, 
while, in [T2|, the authors provide algorithms for a generalized leader election problem in rings 
with arbitrary labels, unknown (and arbitrary) size of the ring, and for both synchronous and 
asynchronous communication. Memory needed for leader election in unlabeled networks was studied 
in |22|. In [2T], the authors investigated the time of leader election in anonymous networks by 
characterizing this time in terms of the network size, the diameter of the network, and an additional 
parameter called the level of symmetry, which measures how deeply nodes have to inspect the 
network in order to notice differences in their views of it. In m , the authors studied the feasibility 
of leader election among anonymous agents that navigate in a network in an asynchronous way. 
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Providing nodes or agents with arbitrary kinds of information that can be used to perform network 
tasks more efficiently has previously been proposed in [U HOl [131 [13 113 EZl [13 llll [23l [Ml [271 
[291133 133 133 [39]. This approach was referred to as algorithms with advice. The advice is given 
either to the nodes of the network or to mobile agents performing some network task. In the first 
case, instead of advice, the term informative labeling schemes is sometimes used if (unlike in our 
scenario) different nodes can get different information. 

Several authors studied the minimum size of advice required to solve network problems in an 
efficient way. In [33, given a distributed representation of a solution for a problem, the authors 
investigated the number of bits of communication needed to verify the legality of the represented 
solution. In [T3 , the authors compared the minimum size of advice required to solve two information 
dissemination problems using a linear number of messages. In [13 , it was shown that advice of 
constant size given to the nodes enables the distributed construction of a minimum spanning tree 
in logarithmic time. In [T3, the advice paradigm was used for online problems. In II3, the authors 
established lower bounds on the size of advice needed to beat time 0(log* n) for 3-coloring cycles 
and to achieve time 0(log* n) for 3-coloring unoriented trees. In the case of [33, the issue was not 
efficiency but feasibility: it was shown that 0(nlogn) is the minimum size of advice required to 
perform monotone connected graph clearing. In [27], the authors studied radio networks for which 
it is possible to perform centralized broadcasting in constant time. They proved that constant time 
is achievable with 0{n) bits of advice in such networks, while o{n) bits are not enough. In [23, the 
authors studied the problem of topology recognition with advice given to the nodes. In 13, the 
task of drawing an isomorphic map by an agent in a graph was considered, and the problem was to 
determine the minimum advice that has to be given to the agent for the task to be feasible. To the 
best of our knowledge, the problem of leader election with advice has never been studied before for 
anonymous networks. In |34j . the authors investigated the minimum size of advice sufficient to find 
the largest-labelled node in a graph. The main difference between [3T| and the present paper is that 
we consider networks withont node labels. This is not a small difference: from the methodological 
perspective, breaking symmetry in anonymous networks relies heavily on the structure of the graph, 
and, as far as results are concerned, much more advice is needed. 


2 Terminology and preliminaries 


In this paper we use the word path to mean a simple path in the tree. For nodes a and b, we denote 
by d{a, b) the distance from a to b, and by path{a, b) the path (a,..., 6). Nodes a and b are called 
the endpoints of this path. Let 6 be a node in path{a, c). We say that path{a, c) is the concatenation 
of path{a, b) and path{b, c) and we write path{a, c) = path{a, b) ■ path{b, c). The length of a path 
P, denoted by |P|, is the number of edges in it. Denote by seq{a, b) = {pi,... ,ps) the sequence of 
all ports encountered when moving from a to 6 on path{a,b). 0dd-indexed terms in seq{a,b) are 
called the outgoing ports of seq{a, b). We also use the operator • to denote the usual concatenation 
of sequences of integers, e.g., when concatenating two port sequences. 

Let n be a node of a tree T and let r be a non-negative integer. An endless path in V{v,r) is a 
simple path of length r, with endpoints v and v', such that v' is not a leaf in T. A terminated path 
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in V{v, r) is a simple path of length at most r, with endpoints v and u', such that v' is a leaf in T. 
See Figure for examples of terminated and endless paths. 
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r nodes 


Figure 2: An example of a node v's view up to distance r, i.e. V{v,r). The path between v and v' 
is a terminated path in V{v,r), while the path between v and v" is an endless path in V{v,r). All 
endless paths in V{v,r) starting at v pass through node v". 


3 Time diam — 1 

In this section, we show tight upper and lower bounds of 0(logZI) on the minimum size of advice 
sufficient to perform leader election in time diam — 1 in trees of diameter diam < D. The upper 
bound 0(log D) is straightforward: given the value of diam, every node v can reconstruct the entire 
tree from V{v, diam — 1) as follows. For each endless path with endpoints v, v' , where v' has some 
degree k, node v attaches k — 1 leaves to v'. Hence, by using any centralized algorithm on the entire 
tree, all nodes can perform leader election whenever the tree is not symmetric. (This also shows 
that every non-symmetric tree T has ^(T) < diam — 1.) The matching lower bound is given by the 
following theorem. 

Theorem 3.1 Consider any algorithm ELECT such that, for every non-symmetric tree T, algo¬ 
rithm ELECT solves election within diam{T) — 1 rounds. For every integer D > 3, there exists a 
tree T with diameter at most D and f,{T) < diam{T) — 1, for which algorithm ELECT requires 
advice of size H(logT)). 

Proof. Fix any integer D >3. We will show a stronger statement: at least D — 1 different advice 
strings are needed in order to solve election within diam{T) — 1 rounds for some trees T with 
diameter at most D and fiT) < diam{T) — 2. To prove this statement, we first construct a class 
of trees T = {T 2 ,..., Td}, where tree Tk is the path of length k. For each k G {2,, D}, let Ok 
and bk be the endpoints of T^, and label the ports of Tk such that the port sequence seq{ak, bk) is 
equal to (0, 0,1, 0,1,0,..., 1,0). See Figure for an illustration of Tk. We will denote by Pa^. and 
Phj. the sequences of outgoing ports that are outputted by Ofc and bk, respectively, at the end of 
the execution of algorithm ELECT in tree Tk. Note that algorithm ELECT is correct only if, for 
every k G {2,... ,D}, there exists a node ik £ Tk such that sequence Pa^. corresponds to a simple 
path from Ok to ik, and sequence Pb^, corresponds to a simple path from bk to ik- Hence, algorithm 
ELECT is correct only if |Paj.| + \Pbd — ^ each k G {2,..., D}. 
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Figure 3: Example of trees constructed in Theorem 3.1 for k = 5, 6. 


Next, to obtain a contradiction, assume that D — 2 different advice strings are sufficient to solve 
election within diam{T) — 1 rounds for each tree T in T. By the Pigeonhole Principle, there exist 
i,j G {2,..., D} with i < j such that the same advice string is provided to the nodes of both Ti 
and Tj when they execute algorithm ELECT. When executed at node o* in Ti, algorithm ELECT 
halts in some round ra < diam{Ti) — 1 = i — 1 and outputs some port sequence Pa^- Similarly, when 
executed at node bi in T,, algorithm ELECT halts in some round Vf, < i — 1 and outputs some 
port sequence Pb^. As noted above, we have that jPaJ + |AJ = i- We show that, when executed 
at node aj in Tj, algorithm ELECT also halts in round and outputs Pa^. Indeed, the algorithm 
is provided with the same advice string for both Tj and Tj, and Vr^iai, ra) = Vr^ (oj, Tq). Similarly, 
when executed at node bj in Tj, algorithm ELECT halts in round rb and outputs Pb^■ However, 
this implies that, in the execution of ELECT in tree Tj, we have \Paj \ + \Pbj I = \Pai I +1AJ = i <3-, 
which contradicts the correctness of ELECT. 

We finally show that, for every k G {2,... ,P}, we have ^(T^) < k — 2. First assume that k > 2. 
For every node v of T^, at least one of the endpoints of is in V{v,k — 2). Hence v can identify 
its position in the map of T^ and output the sequence of outgoing ports leading from u to a^. For 
k = 2, both leaves output the sequence (0) and the central node outputs the empty sequence. □ 


4 Time diam — 2 

In this section, we show tight upper and lower bounds on the minimum size of advice sufficient to 
perform leader election in time diam — 2, for trees of diameter diam < D. These bounds depend 
on the parity of diam. They are 0(logP) for even values of diam, and 0(logn) for odd values of 
diam. We consider these two cases separately. 


4.1 Even Diameter 


Consider any tree T with n nodes and even diameter diam < D. The lower bound Q{\ogD) on the 
minimum size of advice sufficient to perform leader election in time diam — 2 can be proven exactly 


as Theorem 3.1 We now prove the matching upper bound by providing an algorithm EvenElect 


that solves election in time diam — 2 using 0(log D) bits of advice. The algorithm works by having 
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each node find and elect the central node of the tree, which we denote by Vc- The advice provided 
to the algorithm is the value of diam. Let h = diam/2. The gateway of a node v is defined as 
the node in V{v, diam — 2) furthest from v such that every endless path in V{v, diam — 2) passes 
through gv We now give the pseudo-code of the algorithm executed at an arbitrary node v in tree 
T using advice A. 


Algorithm 1 EvenElect(A) 

1 : Vq ^— 0 

2: diam ^ diameter of T, as provided in A 
3: /i diam/2 

4: Use diam — 2 rounds of communication to learn V(v, diam — 2) 

5: If V{v, diam — 2) contains no endless paths starting at v: 

6: Vc ^ central node of V{v, diam — 2) 

7: Else; 

8: gv the node w in V{v,diam — 2) furthest from v such that every endless path in 

V{v, diam — 2) passes through w 

9: If d{v,gv) < h — 1, or, V{v,diam — 2) contains a node w such that d{w,gv) > d{v,gy): 

10: i ^ h-1 

11: Else; 

12: i ^ h 

13: Vc ^ the node on path{v,gv) at distance i from v 

14: Output the sequence of outgoing ports of seq{v, Vc) 


Theorem 4.1 Algorithm EvenElect solves leader election in trees of size n and even diameter 
diam < D in time diam — 2 using 0{logD) bits of advice. 

Proof. We begin by proving the correctness of the algorithm. In particular, we must show that 
each node v correctly computes the central node Vc- If V{v,diam — 2) contains no endless paths 
starting at v, then V{v,diam — 2) consists of the entire tree. In this case, v can find the central 
node by inspection. Otherwise, it follows that d{v, Vc) G {h — l,h}. The following result shows that 
Vc always lies on the path from v to g^- 

Claim 4.1 If v is a node such that d{v,Vc) G {h — 1, h}, then Vc is on path{v,gv). 

To prove the claim, let Qi = path{v,Vc), and let Q 2 be any path starting from Vc of length h 
such that Q 2 and path{vc,v) are edge-disjoint. Let Q be the path Qi ■ Q 2 . Note that \Q\ > 
{h — 1) + h = 2h — 1 = diam — 1. In particular, in V{v, diam — 2), Q is an endless path starting 
at V. Therefore, by the definition of g^, node g^ is on path Q. If gv is in Q 2 , then Vc is on 
path{v,gv), and we are done. Finally, we show that gv cannot appear before Vc in Q, by way 
of contradiction. Assume it does. By the definition of gv, there must exist some endless path 
in V{v,diam — 2) that contains gv but not Vc- In particular, there must be a path Q' of length 
at least diam — 1 with v as one endpoint that passes through gv but not through Vc- Let v' be 






the endpoint of Q' not equal to v, and let w be the node in Q' that is furthest from v and on 
path{v,Vc)- We consider the length of the path Q" = path{v',w) ■path{w,Vc) ■ Q 2 - (See Figure 
l^for an illustration of the paths defined above.) Since \Q'\ > diam — 1 = 2/i — 1, it follows that 
\path{v,w)\ + \path{w^v')\ > 2 h — l. Also, note that \path{v,Vc)\ = \path{v,w)\ + \path{'w,Vc)\- So, 
\Q"\ = \path{v', rc)| + \path{w, Uc)| + IQ2I > (2h — 1 — \path{v, r(;)|) + {\path{v, Vc)\ — \path{v, tc)|) + 
h = (3/1 — 1) — d{v,w) + {d{v,Vc) — d{v,w)). Finally, since w appears before Vc in Q, we have 
d{v,w) < d{v,Vc) < h. Therefore, \Q''\ > (3/i — 1) — (h — 1) + 1 = 2/i + 1 > diam, a contradiction. 
This concludes the proof of Claim [Tlj 



Qi 


Q 


Q2 


Q’ " Q" 

iii 


Vc 


Figure 4: Paths Qi, Q 2 , Q, Q', Q" as defined in the proof of Claim 4T 
We now show that v correctly computes d{v, Vc) (which it stores in £) at lines 81(12 


Claim 4.2 Suppose that d{v,Vc) G {h — If d{v,gy) < h — 1, or, V{v,diam — 2) contains a 

node w such that d{w, g^) > d{v, g^), then d{v, Vc) = h — 1. 


To prove the claim, first suppose that d{v,gv) < h — 1. It follows from Claim 4.1 that d{v,Vc) < 


d{v, gv) < h — 1, which implies that d{v, Vc) = h — 1. Next, suppose that V{v, diam — 2) contains a 


node w such that d{w,gv) > d{v,gv)- It follows from Claim 4.1 that d{v,gy) = d{v,Vc) + d{vc, gv), 
so d{v,Vc) +d{vc,gv) < d{w,gv) < d{w,Vc) + d{vc,gv)- Hence, d{v,Vc) < d{w,Vc)- Note that, by 
the definition of the central node, we have d{w,Vc) < h. So, we have shown that d{v,Vc) < h — 1, 
which implies that d{v,Vc) = h — 1. This completes the proof of the claim. 


Claim 4.3 Suppose that d{v,Vc) G {h — 1, h}. If d{v,gv) > h — 1, and, for all w G V{v,D — 2), 
d{w,gv) < d{v,gy), then d{v,Vc) = h. 


We prove the contrapositive of this claim. Namely, we show that, if d{v,Vc) = h — 1 and, for all 
w G V{v,diam — 2), d{w,gv) < d{v,gv), then d{v,gv) < h — 1. Let be the hrst edge on the path 
from Vc to V. Let Ty be the subtree induced by all nodes reachable from Vy via a path starting with 
edge Cy. For each node w G Ty, we have path{w, gy) = path{w, Vy) ■ path{vc, gy) since Vy lies on the 
path from v to gy. Hence, d{w,Vc) = d{w, gy) — d{vc, gy) < d{v, gy) — d{vc, gy) = d{v,Vc) = h — 1, i.e., 
every path from Vy to a node in Ty has length at most h — 1. By the definition of the central node, 
there must exist two distinct edges incident to Vy that belong to paths Pi, P 2 of length h starting at 
Vy. Neither of these edges is equal to Cy, since Ty does not contain a path of length h starting at Vy. 
Let zi, Z 2 be the leaves of paths Pi, P 2 , respectively. Since d{v, zi) = d{v, Z 2 ) = d{v, Vc) + h = 2h — l, 
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it follows that path{v, zi) and path{v, Z 2 ) are endless paths in V {v, 2h—2) = V {v, diam—2). Rewrite 
path{v,zi) = path{v,Vc) ■path{vc,zi) and path{v,Z 2 ) = path{v,Vc) ■ path{vc, Z 2 ), and recall that 
the first edge of path{vc, zi) is not equal to the first edge of path{vc, ^ 2 )- By the definition of g^, it 
follows that Qy G path{v,Vc)- Therefore, d{v,gy) < h — 1, which proves the claim. 


By Claim 4.1, Vc is precisely the node at distance d{v,Vc) from v on path{v,gy). By Claims 4.2 


and 4.3, we see that the value of i computed in the algorithm is equal to d{v,Vc)- It follows that v 


correctly calculates Vc at line 13 


Finally, note that the advice consists of the value of diam, so the size of advice is O(logZl). □ 


Note. Theorem 4.1 implies that, for any tree T of even diameter diam, we have ^(T) < diam — 


4.2 Odd Diameter 

We now provide tight upper and lower bounds of 0(logn) on the minimum size of advice sufficient 
to perform leader election in time diam — 2, when diam is odd. Our lower bound is valid even for 
the class of trees with fixed diameter D. For the upper bound, we can provide the value of the 
diameter as part of the advice. 

Consider any tree T with n nodes and odd diameter D. We prove our upper bound by providing 
an algorithm OddElect that solves election in time D — 2 using O(logn) bits of advice. 

At a high level, our algorithm works as follows. Using D — 2 communication rounds, each node 
calculates its simple path to the closest endpoint of the central edge. For each node v, this closest 
endpoint will be called u’s candidate. The main difficulty of the algorithm is breaking symmetry 
between the two candidates. The advice helps the nodes decide which of the two possible candi¬ 
dates should be elected as leader, and provides the port number which leads from the non-elected 
candidate to the leader. To do this with a small number of bits, the advice succinctly describes a 
path which exists starting at one of the two endpoints of the central edge but not the other. The 
nodes that see this path starting from their candidate will elect their candidate, and the nodes that 
cannot see this path starting from their candidate will use the port number provided in the advice 
to elect the other candidate. 

We now provide the details of the advice and the algorithm. Let {cq, ci} be the central edge of the 
tree. For each node v, denote by cand{v) the node in {co,ci} that is closest to v. Let h = 

Note that, by the definition of the central edge, for each node v, we have d{v,cand{v)) < h. Recall 
that the gateway gy of a node v is defined as the node in V{v,D — 2) furthest from v such that 
every endless path in V{v, D — 2) passes through gy. 

We first construct the advice. The first part of the advice string is the exact value of D, which can 
be used to calculate the value of h. The goal of the rest of the advice construction is to succinctly 
describe a sequence of port numbers that distinguishes one of the two candidate nodes from the 
other. 

We divide the set of trees into two classes. We say that a tree is separated if, for each node v, 
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path{v,gy) contains the central edge. One bit of the advice string, called the “separated bit”, has 
value 1 if and only if T is separated. 

If T is separated, we construct the remainder of the advice string as follows. For each of the two 
endpoints co,ci of the central edge, we define the list Lj consisting of all port sequences that can 
be obtained by following simple paths starting at c* that do not contain the central edge. These 
port sequences consist of both the outgoing and incoming port numbers encountered, in order, on 
each path. Each list Lj is sorted in ascending lexicographic order. The following result shows that 
Lq and Li must differ. 

Claim 4.4 For any separated tree T, if Lq = L\, then T is symmetric. 

To prove the claim, it suffices to note that, if Lq = Li, then the subtrees rooted at cq and ci and 
resulting from the removal of the central edge are (port-preserving) isomorphic. It follows that the 
port numbers at the two endpoints of the central edge must be equal. Hence T is symmetric, which 
proves the claim. 

By Claim [T4l there exists a port sequence that appears in exactly one of Lq or Li. Formally, for 
some i G {0,1}, there is an integer j such that the sequence in Li does not appear in Ti-j. The 
remainder of the advice string is a tuple {j,k,m,p) where: 

• fc is the largest integer such that the sequences in Li and Li_j have equal prefixes of 
length k, 

• m is the integer equal to the {k + 1)*^ port number of the sequence in Li, and, 

• p is the port number that leads from ci_i to c*. 

We now describe the advice string in the case where T is not separated. The construction is similar 
to the case where T is separated, except for a change in the definition of the lists Lq and Li. In 
particular, for each of the two endpoints co,ci of the central edge, we define list Lj to be all port 
sequences of length at most 2h — l that can be obtained by following simple paths starting at Cj. As 
before, these port sequences consist of both the outgoing and incoming port numbers encountered, 
in order, on each path. Again, each list Lj is sorted in ascending lexicographic order. The following 
result shows that, also in the case of non-separated trees, lists Lq and Li must differ. 

Claim 4.5 For any tree T that is not separated, if Lq = L\, then ^{T) > D — 2. 

Our proof of the claim proceeds in three steps. First, we find two leaves wq,wi such that wq's 
candidate node is cq, wis candidate node is ci, and seq{wQ,CQ) = seq{wi, ci). We then show that 
V{wq,D — 2) = V{wi,D — 2). Finally, we show that this implies that f,{T) > D — 2 . In what 
follows, for any sequence s, we will denote by s the reverse of sequence s. 

Finding wq and wi. We first note that, since T is not separated, there must be at least two nodes, 
say a, j3, such that path{a, f3) does not use the central edge and has length 2h. It follows that a 


II 


and /3 have the same candidate node, which, without loss of generality, we assume is cq. Further, 
it follows that d{a, cq) = d{f3, cq) = h, and that the last port numbers in seq{a, cq) and seq{l3, cq) 
are different. Let pi be the port number at ci corresponding to the central edge. Let wq be a node 
in {a, 13} such that the last port number in seq{wo, cq) is not equal to pi. We now set out to find a 
node wi such that cand{wi) = ci and seq{wi^ci) = seq{wQ,co). Since d{co,wo) = h, it follows that 
Wo is a leaf, so seq{co,wo) is a sequence of length 2h with last port number equal to 0. Let a be the 
prefix of length 2/i — 1 of seq(co, wq). Since cr is a sequence of length 2h — 1 that can be obtained 
by following a simple path starting at cq, we know that a appears in Lq. Since Lq = Li, it follows 
that a also appears in Li. Let wi be the node that is reached by following the outgoing ports of a 
starting at ci. By our choice of wq, the first port number in a is not equal to pi, so path{ci,wi) 
does not use the central edge. It follows that cand{wi) = ci. Also, since wq and wi are leaves, it 
follows that seq{wo,co) = (0) • d = seq{wi,ci). This completes the first step of the proof. 

In what follows, let cq be the first edge on the path from cq to wq, and let ei be the first edge on 
the path from ci to wi. Let Tq be the subtree induced by all nodes that can be reached by a simple 
path starting with edge cq, and let Ti be the subtree induced by all nodes that can be reached by a 
simple path starting with edge ei. Since all nodes in Tq (respectively, Ti) are at distance at most h 
from Co (respectively, ci), the fact that Lq = Li implies that there is a port-preserving isomorphism 
between Tq and Ti. Since seq{wo,co) = seq{wi,ci), it follows that such an isomorphism maps wg 
to rci. 

Showing that V(wg, D — 2) CV (w^, D — 2) (a symmetric argument proves the reverse inclusion.) 
Since D — 2 = 2h — 1, it suffices to show that each sequence (p of at most Ah — 1 port numbers 
obtained by following a simple path starting at wg can also be obtained by following a simple 
path starting at wi. Let (p be any such sequence. We consider two cases. First, suppose that 
does not contain cg. Then P^ lies entirely within Tg. Since there is a port-preserving isomorphism 
between Tg and Ti that maps wg to wi, the same path exists in Ti starting at wi, as desired. 
Next, suppose that P^ does contain cg. We re-write cp = seq{wg^ cq) • (p' for some port sequence cp'. 
Since seq{wg,co) = seq{wi,ci) and |seg(rco, co)| = 2h, it follows that the first 2h ports of (p form 
seq{wi,ci). Further, since \(p\ < 4h — 1, it follows that \(p'\ < 2h — 1. Since cp' corresponds to a 
path starting at cg, the sequence cp' appears in Lg. Since Lg = Li, we know that cp' appears in Li, 
so (p' is a sequence of port numbers that can be obtained by following a simple path starting at ci. 
Hence, (p can be obtained by following a simple path starting at wi, as desired. This completes the 
second step of the proof. 

Showing that V{wg, D — 2) = V{wi, D — 2) implies that ^(T) > D — 2. To obtain a contradiction, 
suppose that V(wg, D — 2) = V(wi, D — 2) and assume that there is an algorithm that solves election 
in T within D — 2 rounds (with any amount of advice). For any such algorithm, nodes tco and wi 
output the same value since V{wg, D — 2) = V{wi, D — 2). In particular, they both output outgoing 
port sequences of equal length, say £. Since wg and wi have different candidate nodes, and both 
Wg and wi must elect the same node, it follows that at least one of the paths obtained by following 
their outputs must cross the central edge. Hence, £ > h. However, each of their outputs forms a 
path of length i ending at the elected node. These two paths combine to form a simple path of 
length 2i > 2{h + 1) > D, a. contradiction. This concludes the proof of the claim. 


12 



By Claim 4.5, lists Lq and Li must differ in the case that ^(T) <44 — 2, i.e., when leader election 
is possible in time D — 2. The remainder of the advice string consists of the tuple {j,k,m,p) as 
defined in the case of separated trees. This concludes the description of the advice. 


We now give the pseudo-code of the algorithm executed at an arbitrary node v in tree T using 
advice A. 


Algorithm 2 OddElect(A) 

1: e ^ 0 

2\ D diameter of T, as provided in A 
3: h^{D- l)/2 

4: Use D — 2 rounds of communication to learn V{v,D — 2) 

5: // Stage 1: compute cand{v) 

6: If V{v, D — 2) contains no endless paths starting at v: 

7: e •(— central edge of V{v, D — 2) 

8: cand{v) •(— endpoint of e closest to v 

9: Else: 

10: Qv the node w mV{v,D — 2) furthest from v such that every endless path in V{v,D — 2) 

passes through w 

11: If U(u,i4 — 2) contains a node w such that d{w,gv) > d{v,gv)- 

12: f ^ /l - 1 

13: Else: 

14: h 

15: cand{v) ^ the node on path{v, g^) at distance 4 from v 

16: // Stage 2: determine whether or not cand{v) should be elected as leader 

17: If the “separated bit” in A is 1: 

18: lfe = 0: 

19: e the edge incident to cand{v) that lies on all endless paths starting at v 

20: Compute seq{v,v') for each v' such that path{v,v') does not contain edge e 

21: Ly the lexicographically-ordered list of all such seq{v, v') 

22: Else: 

23: Compute every port sequence of length at most 2h — 1 corresponding to paths starting at 

cand{v) 

24: the lexicographically-ordered list of all such sequences 

25: Retrieve j, k, m from A 

26: If the {k + port number of the sequence in is equal to m: 

27: electMyCandidate •(— true 

28: Else: 

29: electMy Candidate ■(— false 

30: // Stage 3: compute output 
31: If electMy Candidate = true: 

32: Output the sequence of outgoing ports of seq{v,cand{v)) 

33: Else: 

34: Retrieve p from A 

35: Output the sequence of outgoing ports of seq{v,cand{v)) with p appended to the end 
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Theorem 4.2 Algorithm OddElect solves leader election in trees T with size n, odd diameter D 
and i{T) < D — 2, in time D — 2 using O(logn) bits of advice. 


Proof. We begin by proving the correctness of each stage of the algorithm. 

First stage. We must show that each node v correctly computes cand{v). If V{v,D — 2) contains 
no endless paths starting at v, then V{v,D — 2) consists of the entire tree. In this case, v can 
find the central edge by inspection and calculate cand{v) as the closest endpoint of the central 
edge. Otherwise, it follows that d{v, cand{v)) G {/i — 1, h}. The following result shows that cand{v) 
always lies on the path from v to gy. 


Claim 4.6 If v is a node such that d{v, cand{v)) G {h — 1, h}, then cand{v) is on path{v, gy). 


To prove the claim, assume, without loss of generality, that cand{v) = cq. Let Qi = path{v,co), 
and let Q 2 be any path starting from ci of length h that does not use the central edge. Let Q 
be the path Qi ■ (co,ci) • Q 2 . Note that \Q\ > {h — 1) + 1 + h = 2h = D — 1. In particular, in 
V{v,D — 2), Q is an endless path. Therefore, by the definition of gy, node gy is on path Q. If 
gy is in (co,ci) • Q 2 , then cq is on path{v,gy), and we are done. Finally, we show that gy cannot 
appear before cq in Q, by way of contradiction. Assume it does. By the definition of gy, there must 
exist some endless path in V{v,D — 2) that contains gy but not cq. In particular, there must be 
a path Q' of length at least D — 1 with v as one endpoint that passes through gy but not cq. Let 
v' be the endpoint of Q' not equal to v, and let w be the node in Q' that is furthest from v and 
on path{v, cq). We consider the length of the path Q" = path(v', w) ■ path{w, cq) • (cq, ci) ■ Q 2 . (See 
Figure]^ for an illustration of the paths defined above.) Since \Q'\ > D — 1 = 2h, it follows that 
\path{v,w) \ + \path{w,v')\ > 2h. Also, note that \path{v,co)\ = \path{v,w)\ + \path{w,co)\. So, 
\Q"\ > (2/i— \path{v, w)\) + {\path{v , co)| — \path{v, tc)!) +1 + /i = 3/i +1 + d(u, cand{v)) — 2 ■ d{v, w). 
Finally, since w appears before cq in Q, we have d{v,w) < d{v,co) < h. Therefore, \Q''\ > 


3h + 1 + (h — 1) — 2{h — l) = 2h + 2>D,a contradiction. This concludes the proof of Claim 4.6 


Qi 


iii 


Q 


Q2- 


- - . - ^ 

-Q' ! Q" - 


Co Cl 


Figure 5: Paths Qi,Q 2 ,Q,Q , Q as defined in the proof of Claim 4.6 


We will use the following claim to show that each v can determine its exact distance from cand{v). 


Claim 4.7 Consider any node v such that d{v,cand{v)) G {h — l,h}. There exists a node w in 
V{v, D — 2) such that d{w, gy) > d{v, gy) if and only if d{v, cand{v)) = h — 1. 
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We first prove the “ if” direction. Suppose that d{v, cand{v)) = h — 1. Let w he a node such that 
cand{w) = cand{v) and d{w, cand{w)) = h. The existence of w is guaranteed by the definition 
of the central edge. Thus, we have d{v,cand{v)) < d{w, cand{v)). Further, we have d{v,w) < 
d{v,cand{v)) + d{cand{v),w) = 2h — 1 = D — 2, so w is a node in V{v,D — 2). By Claim [4^ 
node cand{v) is on path{v,gy), so d{v,gy) = d{v,cand{v)) + d{cand{v), gy) < d{w,cand{v)) + 
d{cand{v), gy) = d{w,cand{w)) + d{cand{w), gy) < d{w,gy), as required. 


Next we prove the “only if” direction. Suppose that d{v,cand{v)) = h. Let w be any node in 
V{v,D — 2). In particular, this means that d{v,w) < D — 2 = 2h — 1. There are two cases 
to consider based on whether or not cand{v) = cand{w), i.e., whether or not v and w are on 
the same side of the central edge. In the first case, suppose that cand{v) = cand{w). It fol¬ 
lows that d{w, cand{v)) < h = d{v, cand{v)), so d{w,gy) < d{'w, cand{v)) + d{cand{v), gy) < 
d{v,cand{v)) + d{cand{v), gy) = d{v,gy) (where the last equality follows from Claim [T^ . Hence, 
d{v, gy) > d{w, gy), as required. In the second case, suppose that cand{v) / cand{w). It follows that 
path{v, cand{v)) and path{w, cand{v)) intersect only at node cand{v) since the shortest path from w 
to cand{v) passes through cand{w) first. Therefore, d{v, w) = d{v, cand{v))+d{cand{v),w). Since w 
is in V(v, D—2), it follows that d{v, w) < D—2 = 2h—l, so d(cand(v), w) = d{v, w)—d{v, cand{v)) < 
{2h — 1) — h = h — 1. Thus, d{w, gy) < d{w, cand{v)) + d{cand{v), gy) < {h — 1) + d{cand{v), gy) < 
d{v,cand{v)) + d{cand{v), gy) = d{v,gy) (where the last equality follows from Claim [T6| ) . Hence, 
d{v,gy) > d{w,gy), as required. This concludes the proof of the claim. 


By Claim 4.6, cand{v) is precisely the node at distance d{v,cand{v)) from v on path{v,gy). By 


Claim |4.7[ we see that the value of £ computed in the first stage of the algorithm is equal to 
d{v, cand{v)). It follows that v correctly calculates cand{v) during stage 1. 


Second stage. To prove the correctness of this stage of the algorithm, we show that if cand{v) = Ci 
for some i G {0,1}, then Ly = Lj (as defined in the advice construction). In the case where T is 
a separated tree, we see that the construction on lines |20| and |21| matches the definition of La, as 
long as e is the central edge. If the central edge was assigned to e during stage 1, then e is still the 


central edge at line 20 Otherwise, we must show that the central edge is assigned to e at line 19 
The following result confirms that this is the case. 


Claim 4.8 Suppose that T is separated and consider any node v such that V{v, D — 2) contains at 
least one endless path starting at v. The central edge ofT is the edge incident to cand{v) that lies 
on all endless paths starting at v. 


To prove the claim, recall that, since T is separated, path{v,gy) contains the central edge. Further, 
by the definition of gy, every endless path starting at v passes through gy. It follows that every 
endless path starting at v contains the central edge, which proves the claim. 


In the case where T is not a separated tree, it suffices to show that line 23 is possible to carry out, 
i.e., that each node v can compute every port sequence of length at most 2h — 1 corresponding to 
paths starting at cand(v). To see why this is the case, note that D — 2 = 2/i — 1, so each node v 
knows V{v,2h — 1). Since d{v,cand{v)) < h and v has computed cand{v), it follows that v also 
knows V{cand{v), h — 1). From V{cand{v), h — 1), v can compute all port sequences of length at 
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most 2h — 2 corresponding to paths of length h — 1 starting at cand{v), and, using the degrees of 
nodes at distance h — I from cand{v), v can compute all port sequences of length at most 2h — 1 
corresponding to paths starting at cand{v). This matches the definition of Lj in the case when the 
tree is not separated. Therefore, regardless of whether T is separated or not, the list is identical 
to the list Li corresponding to u’s candidate Cj, as produced in the advice construction. 

Third stage. To prove the correctness of this stage of the algorithm, we show that all nodes 
output a sequence of outgoing ports leading to the same node. Without loss of generality, assume 
that, in the advice construction, the {k + 1)*^ port of the sequence in Lq is equal to m, while 
the {k + 1)*^ port of the sequence in Li is not equal to m. Hence, p is defined as the port 
leading from ci to cq. We will show that the output of every node leads to node cq. We showed 
above that, in stage 2 of the algorithm, each node v with cand{v) = cq sets to Lq, so each such 
V sets electMyCandidate = true. Moreover, each node v with cand{v) = ci sets to Li, so each 
such V sets electMyCandidate = false. It follows that each node v with cand{v) = cq outputs a 
sequence of outgoing ports leading to cq, while each node v with cand{v) = ci outputs a sequence 
of outgoing ports leading to ci, with port p appended. However, p leads from ci to cq, so the output 
of each node v with cand{v) = ci leads to cq. This concludes the proof of correctness. 

Finally, note that the advice consists of 1 “separated bit”, the value of D, and four integers whose 
values are each bounded above by n. Thus, the size of advice is O(logn). □ 

We finish this section by proving the matching lower bound of ^(logn) on the minimum size of 
advice sufficient to perform leader election in time diam — 2 , for odd values of diam. As a first 
step, we analyze the following abstract pair breaking problem with parameter Z, where Z is a 
positive integer. Denote by X the set of pairs (a, b), where a,b G {1 ,... ,Z} and a < b. The set 
X is coloured with c colours by a colouring function C : X —)■ {1,... ,c}. Knowing Z and C, 
the goal is to find a function B : {1 ,..., Z} x {1,..., c} —)■ {0,1} with the following property: 
B{a,^) / B{b,^), where 7 = C{{a,b)). Such functions will be called symmetry breaking functions. 
What is the minimum integer c for which there exists a colouring function C : X —)■ {1,... ,c} 
such that this goal is attainable? 

We can interpret this problem as a game, in which for an instance (a, b) players get each one part 
of the instance (a or b), together with the colour of the instance, and each of them has to output 
“I” or “you”, in such a way that they agree on who is the winner. 

In what follows, we will consider the number of colours used by a fixed colouring function C on 
certain subsets of A. To this end, we define the following notation. For any S C {1,...,Z}, define 
Xs = {(O) ft) I a,b G S, a < b}, and define cs to be the number of different colours used by C on the 
elements of Xs- 

Lemma 4.1 Consider the pair breaking problem for parameter Z. Suppose that there exists a 
colouring function C : X —)■ {1,..., c} for which there exists a symmetry breaking function B. For 
any positive integer k >2 and any S' C {1,..., Z}, if |5| > |/c! + J2i=o (k-i)\ ’ 

Proof. We prove the result by induction on k. In the base case, i.e., when k = 2, suppose that 
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|5| > 3, and, to obtain a contradiction, assume that cs < 2. Then, for any a,b,c G S along with 
the single colour 7 used to colour Xs, we require all of the following: 


1. S(a,7)/S(6,7) 

2 . i 3 (a, 7 ) / i 3 (c, 7 ) 

3. ^( 6 , 7 ) / S(c, 7 ) 


From (2), we see that S(a, 7 ) / S(c, 7 ), so, by (3), it follows that 13{a, 7 ) = 13{b, 7 ). This contradicts 
(1), so our assumption that cs < 2 was incorrect. 

As induction hypothesis, assume that, for some fc > 2 , if I^I > ^kl + YliZo (k-iy. ’ then cs > k. 

We now consider any set S = {si,..., Sm} with m > |(/c + 1)! + Yli=o (i+i-l')! ’ prove that 
Cs > k + 1. To obtain a contradiction, assume that cs < k. 


Consider the subset of Xs consisting of (sj, Sm) for all i G {1,..., m — 1}. By the Pigeonhole Princi¬ 
ple, at least of these pairs are assigned the same colour a by C. Let S' = {sj | C((sj, Sm)) = a} 
and note that |5'| > 


cs 


We first show that cs' > k, and then we will prove that this leads to 


a contradiction. From our assumption that cs < k, note that |S"| > 

s„. , _ 1 = + 1), + j-J-i 


m—1 


> 


m—1 


> 


m—1 


Further 


m. - 1 > Uk + 1 )> + Ett riiSSi - 1 = i(t + 1)! + Eti AtA = i(t + 1 )! + Elt It 


i=0 (fc+l-i)! — 2 V^ w i;. -r (fc+i_i)! — 2 V''' w x;. w • 

follows that |5'| > > |A:! -|- J2i=o {k-iy • induction hypothesis, C 5 / > k. 


Finally, we show that cs' > k contradicts our assumption that cs < k. To do this, we first show 
that C((a, b)) 7 ^ a for every (a, b) G Xs'. Consider an arbitrary (a, b) G Xs', and let S" = {a, b, Sm}. 
Note that Xs" = {{a,b), {a,Sm), {b,Sm)}, and, from the base case above, note that cs" > 2 . Since 
a,b G S', it follows that {a,Sm) and {b,Sm) are assigned a by C. Therefore, the remaining element 
of Xs", i.e., (a,b), must be assigned a colour other than a, as desired. However, this implies that 
cs > k. Indeed, at least k colours, all different from a (which C uses to colour the pairs (s,, Sm) for 
each Si G S') are used to colour Xs' (see Figure]^ for an example.) This completes the induction. 
□ 


Corollary 4.1 For any colouring function C : X —)• {1,..., c} for which there exists a symmetry 
breaking function, we have c G Q.{-\/\ogZ). 


Proof. We apply Lemma 


4.1 


with S = {1,..., Z], with Z > 3. We set Z = '^k\ + 


^fc-3 fc! 


i =0 {k—i)\ 
k-3 k\ 


and 


find a lower bound for k in terms of Z. Clearly k > 2. Note that fc! < k^, and that {k-iy. ~ 

1 + k + ... + < (fc — 2)kf'~^ < k^. Hence, Z < 3fc^, so log^ < 3fclogfc < 3fc^. Therefore, 
c> k > \ \ log Z, as required. □ 


We will now use the above result on the pair breaking problem to obtain a lower bound on the size 
of advice for leader election in time D — 2, for trees with odd diameter D. To this end, we define 
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5i S2 S3 S4 S5 Se Sj Sg Sg 


Figure 6 : An illustration of the colouring argument in Lemma 4.1 In this example, S = {si,..., sg} 
and Xs is the section of the grid that lies above the dotted line. Assume that C has assigned an 
integer colour to every pair in Xs, and that the entries along row sg are coloured as indicated. 
Then, S' = {s 2 , S 4 , S 5 , ss}, and we have highlighted the corresponding rows and columns. The set 
Xs' consists of the entries where these rows and columns intersect, i.e., the union of the patterned 
entries. As demonstrated in the proof, for every (a,b) € Xs', entry ( 0 , 6 ) is not coloured a, since 
otherwise the three entries (a, b), (a, sg), and ( 6 , sg) would contradict the base case. Therefore, the 
total number of colours needed to colour Xs is at least one greater than the number needed to 
colour Xs'- 


the following class of trees of odd diameter D, called double brooms. These trees will have size m 
which satisfies 6{5 + 1) = (m — {D — 3) — 2)/2 for some integer 6. Consider any such integers m 
and (5. Let / : Z+ —> (Z^)*^ be any (computable) bijection from positive integers to d-tuples of 
positive integers. We define the double broom DB^{a, b) of size m, with positive integer parameters 
a < b < 5^ as follows. The handle of DB^{a, b) is a path of length D — A, with endpoints Va and 
Vb, such that the port sequence seq{va, Vb) = (0, 0,1,1,0,0,..., 1,1, 0, 0) is a palindrome. To each 
endpoint of the handle, attach the following tree of height 2 rooted at this endpoint. The first level 
of the tree Ta attached to Va consists of 5 + 1 nodes wi ,The port at Va corresponding to 
the edge {va, Wi} is i and the port at Wi corresponding to this edge is 0. Let /(a) = (ai,... as) and 
let 05+1 = {m — {D — 3) — 2)/2 — 6 — Yli=i (The term 05+1 is defined in this way to ensure that 
the entire double broom has size exactly m.) For each i G {!,...,<5 + 1}, attach ai leaves to node 
Wi. The tree Tb attached to the endpoint Vb is defined analogously. This concludes the description 
of the double broom DBs{a, b), see Fig. 

The following lemma gives a reduction from the pair breaking problem to leader election in time 
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Figure 7: Double broom DB^{a,h) 


D — 2 for double brooms. 


Lemma 4.2 Suppose that there exists an algorithm ELECT solving leader eleetion in time D — 2 
for the class of double brooms DBs{a,b) with odd diameter D, fixed 5, and all positive integer 
parameters a <b <5^, which uses advice of size o(loglog(5^). Then, for the pair breaking problem 
with parameter Z = 5^, there exists a colouring function that uses o{^/log Z) colours for which there 
exists a symmetry breaking function. 

Proof. We define a function E which maps instances of the pair breaking problem to double 
brooms. More specifically, E maps each pair (a, 6) to the double broom DBs{a,b). Let A be 
the advice function for algorithm ELECT that maps double brooms DBs{a,b) to binary advice 
strings. Let A be the the range of A, i.e., the set of all advice strings needed by the algorithm. Let 
g : A —7- {1,..., |yl|} be any bijection from binary strings to positive integers. 

For the pair breaking problem with parameter Z, we define the colouring function C : X —)■ 
{1,..., c} that takes each instance (a, b) of the pair breaking problem and maps it to g{A{E{a, b))). 
Intuitively, the colour of an instance (a, b) is set to the advice given for the corresponding dou¬ 
ble broom. We now show that c G o(\/log Z). Since the size of advice for algorithm ELECT 
is o(loglog(I^), it follows that every advice string provided to the algorithm has size less than 
I log log (5'^, for sufficiently large 5. It follows that the number of different advice strings needed 
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by the algorithm is at most (log(5*^)^/^. Therefore, the range of g has size at most (log G 
o(-v/log Z), as required. 

Next, we show that there is a symmetry breaking function B that uses this colouring function. 
For any integer 2 ; G {1,..., Z}, we map 2 ; to a double broom with one parameter equal to 2 ; and 
the other parameter equal to the smallest positive integer not equal to 2 ;. In particular, let G be 
a function such that G(l) = DBs{l,2) and G{z) = DBs{l,z) for all 2 ; G {2, ...,Z}. Define the 
binary function B as follows. It takes integer inputs 2 ;, 7 , runs the ELECT algorithm on G(z) with 
advice and outputs 0 if and only if the elected node is within distance {D — 5)/2 from 

(i.e., closer to the endpoint of the handle to which is attached). We now prove that B is indeed 
a symmetry breaking function. For any instance (a, b) of the pair breaking problem, consider the 
values of B{a,C{a, b)) and B{b,C{a, b)). First, note that g~^{C{a, b)) is the advice string, say s, that 
is provided to the ELEGT algorithm for the tree DBs{a,b). So, on the input pair (a, C(a, 6 )), our 
function B runs ELEGT on G{a) with advice s. Note that, in the construction of both G{a) and 
DBg(a, b), the same tree Ta is attached to node Va- Since the ELEGT algorithm uses time D — 2, it 
follows that, for any leaf v in Ta, we have VQ(^a) L> — 2) = VDBg(a,b) D — 2) (since the handle has 
length D — A and Ta has height 2). Hence, v elects the same node in G{a) as it does when ELEGT 
is executed on DBs{a,b). Similarly, on the input pair {b,C{a,b)), the function B runs ELEGT 
on G{b) with advice s, and elects the same node in G{b) as it does when ELECT is executed 
on DBs{a,b). However, for any node w in DBj{a,b), exactly one of Va or is within distance 
[D — 5)/2 from w. Therefore, B{a,C{a,b)) ^ B{b,C{a,b)), and hence B is symmetry breaking, as 
required. □ 


Corollary 4.1 and Lemma 4.2 imply the following theorem. 


Theorem 4.3 Let D < n be positive integers, where D is odd. There exists a class T of trees T 
with size 0(n), diameter D and ^(T) < D — 2, such that every leader election algorithm working 
in time D — 2 on the class T requires advice of size H(logn). 

Proof. Choose integers m and 5, such that 6{6 + I) = (m — (D — 3) — 2)/2 and 6 G Q{y/n). Hence 
m G 0(n). Let T be the class of double brooms DBs{a,b), for all positive integers a < b < 5^. 
Corollary |4.1| and Lemma [4.2| imply that the size of advice required by any leader election algorithm 
on this class is H(loglog(5‘^) C H(logn). 

For each node v in any DBs{a, b), the view yDBs(a,b){'^i D — 2) includes both Va and Vb, as well as 
at least one of the subtrees Ta or Tj,. Therefore, given the entire map of DBs{a,b), node v can 
locate itself on the map. Further, node v can compute values of a and b and output the sequence 
of outgoing ports of seq{v, v^), where g = min{a, b}. Hence (,{T) < D — 2. □ 


Theorems 4.2 and 4.3 imply the following corollary. 


Corollary 4.2 The minimum size of advice sufficient to do leader election in time D — 2 in the 
class of Q{n)-node trees T with odd diameter D and f,(T) < D — 2 is 0(logn). 
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5 Time (5 ■ diam <r< diam — 3, for any constant jd > 1/2 


For the time interval [/3 • diam, diam — 3], for any constant fd > 1/2, we provide upper and lower 
bounds on the minimum size of advice sufficient to perform leader election. Our bounds are 
separated by a gap of only O(logn), except for the special case when diam is even and time is 
diam — 3. 


5.1 Upper bound 


Consider a tree T with diameter diam = D, and let h 


[D/2\. Let e = ^ and note that 


At a high level, our algorithm first partitions the set of leaves into k classes such that leaves in the 
same class belong to the same subtree of height [tD\ — 1. Let Ri,..., be the roots of these 
subtrees of height [eD\ — 1. Next, each node in T chooses one of these Ri as its representative. The 
advice provided to the algorithm consists of k pieces, one piece for each Ri G {Ri,..., R^}. Each 
piece instructs how to reach the leader starting at node Ri. Therefore, to solve leader election, each 
node V can compute a path to the leader using the path to its representative, along with the advice. 
The main difficulty in designing the algorithm is to ensure that each node finds its representative 
and determines which piece of advice corresponds to it. 


In what follows, we will make use of an injective function F that maps rooted trees with at most 
n nodes to binary strings of fixed length in 0{n). (One example of such a function is discussed in 
Section]^) We will apply this function to views of nodes in order to distinguish them. When F is 
applied to a node’s full view of the tree (i.e., to V{v,n)), then the resulting binary string will be 
called the node’s signature. Given two distinct nodes v,w € T, we will say that v has a smaller 
signature than w if F{V[v,n)) is lexicographically smaller than F(V{w,n)). 

To aid in the description and analysis of our algorithm, we carefully choose a node c as the root of 
T. This is the node that the algorithm will elect. In the case where D is even, the central node of 
T is chosen as the root. In the case where D is odd, the node on the central edge of T that has 
the smaller signature is chosen as the root. In what follows, the depth of a node is defined as its 
distance from the root c. 


The Representatives. We define the representatives of an arbitrary tree K using the following 
greedy subroutine that takes as input the map of K and a designated root r on the map. 
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Algorithm 3 ComputeReps(A, r) 

1: i ^ 0 

2 : UncoveredLeaves ■(— set of all leaves of K that have depth at least [e-Dj — 1 
3: While Uncover edLeaves ^ 0 
4: i i + 1 

5: Xi ^ deepest leaf in UncoveredLeaves 

6 : Vi •(— ancestor of Xi at distance YeD\ — 1 

7: Remove from Uncover edLeaves any descendants of Vi 

8: Output (ri,..., Vi) 


Since the size of UncoveredLeaves decreases by at least one in every iteration of the while loop, 
there exists some positive integer k such that the procedure terminates after k iterations of the 
while loop. Note that path{xi,ri),... ,path{xk, r^) are k disjoint paths, each containing YeD\ nodes 
of K. It follows that k £ 0{\K\/D). 


The Advice. We now describe the advice provided to the algorithm for tree T. First, the oracle 
computes the representatives Ri,...,Rk of T by executing ComputeReps(r, c). Then, for each 
i £ {1,... ,/c}, the oracle computes a list Li consisting of the sequences seq{Ri,w) for every iv at 
distance at most h from R^. Each Li is sorted lexicographically, and Zi is defined to be the index 
of seq{Ri, c) in this list. The purpose of the integers zi,..., Zk is to enable each Ri (and each node 
that has representative Ri) to compute the path from Ri to c. However, we cannot assume that 
nodes know the index of their representative, i.e., nodes may not know which Zi should be used to 
compute the path from their representative to c. To remedy this, the oracle includes in the advice 
a trie [ 2 ] that enables each node to determine which piece Zi of advice is intended for it to use. 
More specifically, the oracle first computes a list S consisting of F{V{Ri, h)) for each i £ {1,..., A:}. 
Then, a trie is computed for S using the following recursive procedure BuildTrie. 


Algorithm 4 BuildTrie(S') 

1 : If S contains only one string s = F{V{Ri, h)) for some i £ {1,..., k}: 

2: Return a single node labeled Zi 

3: Else: 

4: j £- the largest index such that all strings in S have the same prefix of length j 

5: Sq £- the list of strings in S that have a 0 at index j' + 1, each with prefix of length j + 1 

removed 

6 : Si £- the list of strings in S that have a 1 at index j + 1, each with prefix of length j' + 1 

removed 

7: Return a node labeled j with left child equal to BuildTrie(S'o) and with right child equal 

to BuildTrie(S'i) 


The advice provided to the algorithm is the value of D, the value of the allocated time r, as well 
as the trie computed by BuildTrie(S'). It remains to show that the advice is well-defined, i.e., 
that BuildTrie(S') terminates. This is the case if and only if the strings in S are all distinct. So, 
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the following lemma proves that BuildTrie(S') terminates for all trees in which leader election is 
solvable in time r. 

Lemma 5.1 If^{T) < r then, for all distinct i,j G {1,..., /c}, we have F{V{Ri, t)) FiV{Rj,T)). 

Proof. Suppose that there is an algorithm ELECT that solves leader election in time r (with any 
amount of advice). To obtain a contradiction, assume that, for some distinct i,j G {1,..., /c}, we 
have E{V{Ri,T)) = E{V{Rj,T)). Since E is injective, it follows that V{Ri,T) = V{Rj,T). This 
implies that nodes Ri and Rj output the same value in the execution of ELECT on T. Let Pi 
be the simple path in T starting at Ri that corresponds to Rfs output, and let Pj be the simple 
path in T starting at Rj that corresponds to Rj’s output. Note that \Pi\ = \Pj\. Since Ri and Rj 
must elect the same node, there exists a unique node v such that the paths Pi and Pj first intersect 
at V. Since T is a tree, we can re-write Pi = path{Ri,v) ■ Q and Pj = path{Rj,v) ■ Q for some 
(possibly empty) simple path Q. Since \Pi\ = \Pj\, it follows that \path{Ri,v)\ = \path{Rj,v)\. We 
now observe that \path{Ri,v)\ and \path{Rj,v) \ are bounded above by h. Indeed, if both paths had 
length greater than h, then their union would be a simple path of length at least 2h + 2 > D, a 
contradiction. So, path{Ri, v) and path{Rj,v) are edge disjoint paths of length at most h. However, 
this means that the port at v corresponding to the final edge of path{Ri,v) is different than the 
port at V corresponding to the final edge of path{Rj,v). It follows that seq{Ri,v) / seq{Rj,v). 
This implies that V{Ri,h) / V{Rj,h), and, since h < r, it follows that V{Ri,T) / V{Rj,T), a 
contradiction. □ 

We also provide a retrieval procedure Retrieve(Ti?, s) which takes as input a trie TR and a string 
s. If s belongs to the set S used in the construction of the trie, then the procedure returns the 
value stored in the label of the leaf node corresponding to the string s. In our case, each s is some 
F{V{Ri, h)), and the value stored in the corresponding leaf is Zi. 


Algorithm 5 Retrieve{TR, s) 

1: root G- root of TR 

2: If TR consists of a single node: 

3: Return label of root 

4: Else: 

5: j G- label of root 

6: If {j + 1)*^ bit of s is 0: 

7: TR' G- left subtree of root 

8: Else: 

9: TR' G- right subtree of root 

10: s' G- string s with prefix of length j + f removed 

11: Return Retrieve(rii', s') 


The Algorithm. We now dehne our leader election algorithm ElectWithTrie executed by each 
node u in T given an advice string A. We start by defining a procedure FindRep. For each node v 
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with sufficiently large depth, FindRep computes the representative of v given its view V{v,t) and 
the values of e and D. (Nodes with small depth will not need representatives to perform election.) 
At a high level, the procedure picks an ancestor w of v, finds the subtree rooted at w consisting of 
all of re’s descendants, then executes ComputeReps on this tree. Of the representatives returned by 
ComputeReps, v picks one that is either its descendant or ancestor. 


Algorithm 6 FindRep(u, y(u, r), e, D) 

1: w -^r- the node in V{v,t) at distance [eD\ — 1 from v such that every endless path in V{v,t) 
starting at v passes through w 

2 : upw ^ the edge incident to w s.t. every endless path starting at v passing through w uses upu, 
3: Tw <— subtree of V {v, r) induced by all nodes x such that path{x, v) does not pass through upu, 
4: R ComputeReps(Tu,, re) 

5: i any leaf in V{v, r) such that v lies on path{i, w) 

6: Return any node r in i? such that r lies on path{i, w) 


Below is the pseudocode of the algorithm ElectWithTrie that is executed by each node v in T. 


Algorithm 7 ElectWithTrie(A) 


1 : 

2 : 

3 

4: 

5: 

6 : 

7: 

8 : 

9: 

10 : 

11 : 

12 : 

13: 

14: 

15: 

16: 

17: 

18: 

19: 

20 : 

21 : 


D ^ diameter of T provided in A 
h ^ [D/2\ 

T <— the value of allowed time provided in A 
Use r rounds to learn V{v,t) 

If U(u, r) contains no endless paths starting at v: 

If D is even: 

c ^ the central node of V {v, r) 

Else: 

c the node on the central edge of V{v, r) with smaller signature 
Output sequence of outgoing ports in seq{v, c) 

Else: 

T 1 


D 2 

FindRep(u, V(v, r), e, D) 

F(V(r,h)) 

the trie provided in A 
Retrieve(s, TR) 

lexicographically-ordered list of port sequences seq{r, w) where d{r, w) < h 
path corresponding to sequence in L > path from r to c 

- the walk in T consisting of path{v, r) followed by P 
simple path from u to c obtained from W by removing any non-simple subwalk 


Theorem 5.1 Consider any fixed D, fi such that /3 > 1/2 and (3D < D — 3. For r G [/3D, D — 3], 
Algorithm ElectWithTrie solves leader election in trees T with size n, diameter D and C(T) < t, 
in time r using bits of advice. 
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Proof. For any node v, denote by depth{v) the depth of v with respect to c. To prove the 
correctness of our algorithm, we show that every node elects c. We first consider the nodes that 
have small depth. 


Claim 5.1 For any node v with depth less than \_eD\, v elects c at line 10 


To prove the claim, note that r > \D + \eD\ > + {depth{v) + 1) > /i + 1 + depth{v). Since 

the distance from c to any node in T is at most h + 1, it follows that the distance from v to any 
node in T is at most h + l + depth{v). Therefore, V{v, r) is equal to T. Hence, V{v, r) contains no 
endless paths starting at v, and, at linef^ v elects c. This proves the claim. 


In what follows, we consider the nodes with large depth, i.e., nodes v with depth{v) > \_eD\. We 
first show that each such node picks one of the representatives Ri,..., Rk at line [131 


Claim 5.2 FindRep(n, H(n, r), e, Z?) outputs a node in {Ri,..., Rk}- 


To prove the claim, we show that the set R computed at line of Algorithm FindRep is a subset of 
{Z2i,..., Rk}, i-e., a subset of the output of ConiputeReps(T, c). The proof proceeds in three steps. 
First, we prove that, in the execution of Algorithm FindRep(n, H(n, r), e, H), w is an ancestor 
of n in T such that w has at least one descendant in T at distance [eHj — 1. Next, we show 
that Tw is induced by the descendants of w. Finally, we show that, for such w and T^, we have 
ComputeReps(T^, u)) C ComputeReps(T, c). 

Showing that w is an ancestor of v and w has at least one descendant in T at distance 

[eD\ — 1. We consider w's definition at line of FindRep. Recall that v has depth at least [eD\ 
in T, so V has an ancestor a at distance [eD\ — 1. At least one endless path in V{v,t) starting 
at V passes through a since there is at least one endless path starting at v that passes through c 
(which is an ancestor of a). Hence, a is a node that satisfies the definition on line[^ Moreover, a 
is the only such node, since we can show that every path in V{v,t) starting at v that avoids a is 
a terminated path. Indeed, consider any path from n to a leaf b such that the path avoids a. The 
distance from v to the penultimate node on path{v, a) is [eD\ — 2 and the distance from this node 
to any of its descendants is at most h + 1. Therefore, d{v, b) < h + [eD\ — 1 < (^ + e)D < r. Thus, 
we have shown that a is the unique node that satisfies the definition of rc, so u) is an ancestor of v. 
Also, by definition, d{v, w) = [eiZj — 1, so re has at least one descendant in T at distance [eD\ — 1. 

Showing that is induced by the descendants of w in T. We show that upw is the edge 
between w and w’s parent. As we observed above, there is at least one endless path in V{v,t) 
starting at v that passes through w, so there is at least one edge that satisfies the definition of upw 
Moreover, since the distance from w to any of its descendants b is at most h + 1, and the distance 
from u to tc is [eD\ — 1, it follows that d{v,b) < h + [eD\ < (^ + e)D < r, so no endless path 
starting at v has descendants of w as both of its endpoints. It follows that upu, cannot be an edge 
on a path from re to a descendant of w, and, that every descendant of u) is in H {v, r). This implies 
that Tw is induced by the descendants of w in T. 
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Showing that ComputeReps(r^, w) C ComputeReps(T, c). The proof is by contradiction. First 
note that, for any node x in d{x, w) is bounded above by d{x, c). Next, to obtain a contradiction, 
consider the first iteration j of ComputeReps(T^, re) during which a node rj is chosen as one of 
the outputs and Vj 0 {i2i,..., By the speciheation of ComputeReps, rj was added to the 

output because it was the ancestor of the deepest node Xj in UncoveredLeaves, and, further, 
we have d{rj,Xj) = \_eD\ — 1. Note that d{xj,w) > [eD\ — 1, so d{xj,c) > [eD\ — 1. We 
now consider the execution of ComputeReps(r, c). Since d{xj,c) > [eT>J — 1, it follows that Xj is 
initially in Uncover edLeaves. This means that, in some iteration of the while loop, Xj is removed 
from Uncover edLeaves as the descendant of some chosen Ri ^ Vj. Since Ri ^ rj and both 
Vj and Ri are ancestors of xj, it follows that either Ri is an ancestor of rj or vice-versa. We 
now show that Ri is not an ancestor of Vj. If this were the case, then d{Ri,Xj) > [eD\ — 1. 
In particular, Xj would be a leaf in T such that Xj’s distance to Ri is greater than [eDj — 1, 
which contradicts the choice of Ri as the ancestor at distance exactly [eD\ — 1 from the deepest 
leaf Xi in Uncover edLeaves. So, we have that Vj is an ancestor of Ri. However, this means 
that d{rj,Xi) > d{Ri,Xi) = [eDj — 1 = d{rj,Xj), which we will use later to obtain the desired 
contradiction. We now reconsider the iteration of execution ComputeReps(Ti„, re). Note that 
Ri was not added to the output before this iteration (since, as Ri is an ancestor of Xj, this would 
imply that Xj was already removed from UncoveredLeaves, contradicting our choice of xj.) By 
assumption, rj G {.Ri, • • •, Rk} in all iterations i < j, so it follows that Xi is in UncoveredLeaves at 
the start of iteration j. But, recall that d{rj,Xi) > d{rj,Xj), so we have a leaf with depth greater 
than Xj in UncoveredLeaves at the start of iteration j, which contradicts the definition of Xj. This 
concludes the proof that ComputeReps(T^, tc) C ComputeReps(T, c), which completes the proof of 
the claim. 


Let Ri be the representative picked by v. It remains to show that v computes seq{Ri,c) in lines 
Tips of ElectWithTrie. First, we show that v is able to compute a sufficiently large part of Rj’s 


view. 


Claim 5.3 V{Ri,h) C V{v,t) 


To prove the claim, we first show that v is either a descendant of, an ancestor of, or equal to Ri. 
This is the case since, by lines and of FindRep, there is a leaf i such that v is an ancestor of i, 
and Ri lies on a path from i to an ancestor of v. 

Next, we show that V{Ri, h) C V{v,t) regardless of whether n is a descendant or ancestor of Ri. 
(The claim is obvious for v = Ri.) If u is a descendant of Ri, we note that, by the definition of 
Ri in ComputeReps, the distance from Ri to any of its descendants is at most [eD\. It follows 
that r = {^D + e)D > h + d(v,Ri), so V{v,t) contains V{Ri,h), as desired. If v is an ancestor 
of Ri, we note that V{v,h) contains all descendants of v. The only nodes in V{Ri,h) that are not 
descendants of v are contained in V{v,h — d{Ri, v)) C V (v, h), as desired. This completes the proof 
of the claim. 

We now prove that v correctly computes the sequence of ports from its representative to c. 
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Claim 5.4 At line 


18 


the sequence in L is equal to seq{Ri,c). 


To prove the claim, note that, in the advice construction, the {ziY^ sequence in Lj is equal to 
seq{Ri, c). We show that z = zi and L = Li. 


To prove that 2 : = Zj, note that 2 is assigned the output of Retrieve(s, Ti?), where s = F{V{Ri, h)) 
is one of the strings in S used to build TR. It follows that Retrieve(s, Ti?) returns 2 *. 


To prove that L = Li, note that, on line L is defined as the lexicographically-sorted list of 
sequences seq{r,w) for all w with d{r,w) < h. Since r = Ri, this matches the definition of Li. 


Further, by Claim 5.3, we have that V{Ri,h) C V{v,t), so the computation of L can indeed be 


carried out by v. This concludes the proof of the claim. 


By Claims 5.2 and 5.4 every node with depth at least \_eD\ chooses a representative Ri within its 


view and computes a path from to c. Hence it computes a path from itself to c. This concludes 
the proof of correctness. 


Finally, we consider the size of the advice. In the advice construction, the list S consists of k 
strings (one for each representative), and these strings have some fixed length, say A, in 0{n) (by 
our choice of F.) Consider the trie TR constructed by BuildTrie(S'), as described in Algorithm]^ 


Claim 5.5 The number of leaves inTR is at most k. 


To prove the claim, we provide a one-to-one correspondence / from the leaves of Ti? to the strings 
in S. First, for each node w £ TR, let Sw F S be the list of strings that was provided as the 
parameter to BuildTrie in the execution where w was created. Let jw be the label of node w, and 
let prew be the common prefix of length jw of all strings in Sw (if Sw contains only one string s, then 
prew is defined to be s.) Next, for an arbitrary leaf x £ T, let (xi,..., Xm) be the root-to-leaf path 
of vertices ending at x (i.e., xi is the root of TR and Xm = x.) For every a G {1,..., m — 1}, we 
define bx,a to be 0 if Xj+i is the left child of Xj, and 1 if Xj+i is the right child of Xj. Finally, we define 
/(x) = prcx^ ■ bx,i ■prex 2 ' ■ ■ ■ bx,m-i -pxex^, where • is the string concatenation operator. To see 

why / is one-to-one, consider any distinct leaves x,y £ TR, and let Xa = Va = z he their deepest 
common ancestor. By the maximality of a, leaves x and y are descendants of different children of 
2 , so we have bx,c Y ^y,c- It follows that the bit at position (j.^ -|- 1) -|- {jy^ -|- 1) {jy^ -|- I) in 

/(x) differs from the bit at the same position in f{y), so /(x) Y f{v)- This concludes the proof of 
the claim. 


Claim 5.6 The number of nodes in TR is at most 2k. 


To prove the claim, observe that every node in the trie is either a leaf or has two children. Consider 
the mapping g that maps each internal node w to the rightmost leaf in w's left subtree. Since g is 
one-to-one, we get that the number of internal nodes is bounded above by the number of leaves. 
Therefore, by Claim 5.5, the number of nodes in Ti? is at most 2k, which proves the claim. 
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Claim 5.7 The label of each node inTR has size O(logn). 


To prove the claim, note that each internal node of Ti? is labeled with an integer corresponding 
to an index within a string of length at most A G 0(n). Therefore, the size of each such label is 
O(logn). Next, each leaf is labeled with an integer zt corresponding to an index within a list of 
sequences, all of which correspond to simple paths in T originating at representative Ri. Since the 
number of such simple paths is bounded above by n — 1 (one for each node in T other than Ri) it 
follows that Zi < n — 1. Therefore, the size of zt is O(logn). This concludes the proof of the claim. 


By Claims 5.6 and 5.7, the total number of bits needed to represent TR is 0{k\ogn). Recall that 
k G 0{n/D), so TR can be represented using bits. Since providing the diameter of T and 

the value of r require only O(logn) additional bits, we are done. □ 


5.2 Lower bound 

The lower bound holds even for a slightly larger time interval than we need, namely starting from 
[diam/2\. We split the argument into two cases: when the diameter is odd and when it is even. 

5.2.1 Odd diameter 

Theorem 5.2 Let 7 < D < n be positive integers, where D is odd. Fix any value [yj < t < D — 3. 
There exists a class T of trees T with size Q{n), diameter D, and f,{T) < r, such that every leader 
election algorithm working in time r on the class T requires advice of size Q{n/D). 

Proof. Let h = and let fc = [";§]• Let m = Dk + 2 G 0(n). We define a class of trees T with 
size m G 0(n), odd diameter D, and f,{T) < such that the minimum size of advice needed by 
an arbitrary algorithm ELECT solving leader election in time r for this class is Ll(n/D). 

We start with a single tree G of size m, defined as follows. The edge {co,ci} is the central edge 
of G, and the port numbers corresponding to this edge at cq and ci are both 0. Next, for each 
i G {1,... ,k}, there is a path Pi of length h — 2 with cq as one endpoint. The other endpoint 
of each of these paths will be denoted by pi. Further, the port sequence seq{co,pi) is equal to 
{i, 0,1, 0,1, 0,..., 1,0). The same paths appear with ci as one endpoint, and, for each z G {1,..., k}, 
we will refer to each of these paths, and their corresponding endpoint other than ci, as Qi and qi, 
respectively. The subtree of G described so far is denoted by H. Finally, for each z G {1,..., k}, 
there is a tree with root pi, where Tq,* is a path of length 2. The port sequence from each pi 
to the other endpoint of is (1,0,1,0). Further, for each z G {1,... ,k}, there is a tree Ti^j of 
height 2 with qi as the root. More specifically, Ti^j is a path of length 2 with an additional edge 
incident to the middle node. The port sequences from each qi to the leaves of Ti^j are (1,0,1,0) 
and (1,0, 2,0). This completes the definition of G. Figure [^illustrates tree G. 

Next, for every subset u of {2,..., k}, we define a tree G^- At a high level, G^ is obtained from G 
by swapping the subtrees rooted at pi and qi, for each i ^ a. More specifically, the definition of Go- 
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Figure 8 : Tree G constructed in the proof of Theorem 5.2 with k = A. 


is similar to the definition of G above, except that, for each i € a, tree has as its root and 
tree Ti^j has pi as its root. See Figure]^ for an example of G^- Note that G 0 = G. Further, for any 
a 7 ^ a', we have G^ / Go-'. However, note that for any cr, since the differences between G and Go 
are only at the leaves or neighbours of leaves, we have that the subtree H oi G is also a subtree 
of Go. The following result about H follows from the symmetry of H with respect to the central 
edge and from the fact that pi and qi are images of each other under this symmetry. 


Fact 5.1 For any i G {2,..., k}, the views VniPi, D — 5) and Vniqi, D — 5) are identical. 

The class T is defined as the class of trees Go for all subsets a of { 2 ,..., A:}. We now set out to 
prove a lower bound on the number of different advice strings needed to solve leader election for 
all trees in T. 


Claim 5.8 For any a 7 ^ a', the advice strings provided to algorithm ELEGT for trees Go and G^f 
must be different. . 


We prove the claim by contradiction. Assume that the advice strings assigned to Go and Go' 
are the same. Since a a', without loss of generality, assume that there is an integer i € a 
such that i 0 a'. Consider a leaf v of tree Ti^j, which, in Go, is rooted at node pi. Note that, 
Vg^(v, D — 3) = Vti i(v, 2)UVff(pi, D — 5). In Go', tree Ti^j is rooted at node qi, so Vg^, (v, D — 3) = 
Vti i(v, 2')UVH(qi, D — 5). By Fact 5.1 it follows that Vg„{v, D — 3) = Vg^i D — 3). So, v outputs 
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Figure 9: Tree constructed in the proof of Theorem 5.2, with k = A and a = {2, 3}. 


the same sequence of outgoing ports after executing ELECT in both and G^'- Assume, without 
loss of generality, that in G^, the node elected by v after executing ELECT is closer to cq than 
to Cl- Since the length of u’s output is the same for both G^ and Go-', the node elected by v after 
executing ELECT in Ga' is closer to ci than to cq. However, in both G^ and Go', tree Toq is 
rooted at node p*, and the leaf v' of tree Toq outputs the same sequence of outgoing ports. Hence, 
v' elects the same node in both Go and Go'. Thus, in at least one of Go or Go', nodes v and E do 
not elect the same node, a contradiction. This concludes the proof of the claim. 

Since there are different subsets of {2 ,..., fc}, the number of different advice strings is at least 
2^~^. It follows that the size of advice is Ll{k) = Ll{n/D). 

Finally, for any a, we prove that ■^(Go) < h, which implies that ^(Go) < t . It is sufficient to show 
that, using h rounds of communication, an arbitrary node v given a map of Go can compute where 
it is located in the map. The distance from any node u to a node on the central edge is at most 
h. Let c{v) be the endpoint of the central edge that is closest to v. It follows that V{v, h) contains 
c{v) (which can be identihed by finding the closest node to v that has degree A: + 1). Consider two 
cases. If u = c(u), then the subtree of V{v,h) induced by the nodes that can be reached from v 
via a path starting with port 1 can be used to uniquely identify whether u = cq or u = ci. Indeed, 
Toq is rooted at the node at distance h — 2 from v in this subtree if and only if u = cq. If u 7^ c(u), 
let i be the hnal port number in the port sequence seq{v, c{v)). The subtree of V{c{v),h) induced 
by the nodes that can be reached from c{v) via a path starting with port i either has or Ti^j 
rooted at the node at distance h — 2 from c{v). By identifying which of these two trees appears, v 
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can identify its position on a map of G^- 


□ 


5.2.2 Even diameter 

The lower bound argument for even diameter closely resembles that for odd diameter, as given 
in the previous theorem. However, in this case, it holds only for t < D — A. At a high level, 
we construct trees of even diameter by increasing by 1 the diameter of trees constructed in the 
previous case. On the other hand, we decrease the time by 1, so that the views of certain nodes do 
not change. Rather than providing a list of small but numerous changes in the proof, we give the 
entire modified construction and argument for the reader’s convenience. 


Theorem 5.3 Let 8 < D < n be positive integers, where D is even. Fix any value D/2 < r < D—A. 
There exists a class T of trees T with size Q{n), diameter D, and f,{T) < t, such that every leader 
election algorithm working in time r on the class T requires advice of size Q{n/D). 


Proof. Let h = and let k = 


D-l 


. Let m = {D — l)k + 2 G 0(n). We define a class of trees T 

with size m G 0(n), even diameter D, and ^(T) < ^ such that the minimum size of advice needed 
by an arbitrary algorithm ELECT solving leader election in time r for this class is Ll{n/D). 


We start with a single tree G of size m, defined as follows. Consider a single edge {co,ci}, and 
let the port numbers corresponding to this edge at cq and ci be both 0. Add a path Pi of length 
h with Co as one endpoint, and denote the other endpoint of this path by pi. Add a path Qi of 
length h + 1 with ci as one endpoint, and denote the other endpoint of this path by qi. Let the 
port sequences seq{co,pi) and seq(ci, qi) be (1,0 ,..., 1,0) (where the length of the latter sequence 
is two greater than the former.) 


Next, for each i G {2,..., k}, add a path Pi of length h — 2 with cq as one endpoint. The other 
endpoint of each of these paths will be denoted by pi. Further, let the port sequence seq{co,pi) 
is equal to (i, 0,1, 0,1, 0,..., 1,0). Add the same paths with ci as one endpoint. For each i G 
{2 ,... ,k}, we will refer to each of these paths, and their corresponding endpoint other than ci, 
as Qi and qi, respectively. The subtree of G described so far is denoted by H. Finally, for each 
i G {2,..., k}, add a tree To,* with root pi, where To,* is a path of length 2. Let the port sequence 
from each pi to the other endpoint of To,* be (1,0,1,0). Further, for each i G {2,..., k}, add a tree 
Ti^i of height 2 with qi as the root. More specifically, is a path of length 2 with an additional 
edge incident to the middle node. Let the port sequences from each qi to the leaves of Ti^i be 
(1,0,1,0) and (1,0, 2,0). This completes the definition of G. Figure 10 illustrates tree G. 


Next, for every subset cr of {2,..., k}, we define a tree G^- At a high level, G^ is obtained from 
G by swapping the subtrees rooted at pi and qi, for each i G <t. More specifically, the definition of 
Gfj is similar to the definition of G above, except that, for each i ^ a, tree To^i has qi as its root 
and tree Ti^i has pi as its root. Note that G 0 = G. Further, for any a / cr', we have G^ / Go-'- 
However, note that for any a, since the differences between G and Go are only at the leaves or 
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Figure 10: Tree G constructed in the proof of Theorem 5.3 


neighbours of leaves, we have that the subtree Ff of G is also a subtree of G^- The following result 
about H follows from: the symmetry of H, with Pi and Qi removed, with respect to the central 
edge, the fact that, for i G {2 ,..., k}, pi and qi are images of each other under this symmetry, and 
the fact that pi and Qi cannot deduce the lengths of Pi and Qi in time D — 6. 


Fact 5.2 For any i G {2,..., k}, the views VuiPi-, D — 6) and Vniqi, D — G) are identical. 

The class T is defined as the class of trees G^ for all subsets cr of {2 ,..., k}. We now set out to 
prove a lower bound on the number of different advice strings needed to solve leader election for 
all trees in T. 


Claim 5.9 For any a ^ a', the advice strings provided to algorithm ELEGT for trees G^ and G^' 
must be different. 


We prove the claim by contradiction. Assume that the advice strings assigned to G^ and G^i 
are the same. Since a a', without loss of generality, assume that there is an integer i € a 
such that i 0 a'. Consider a leaf v of tree Ti^j, which, in G^, is rooted at node pi. Note that. 


in Go-, Vg^{v,D — 4) = Vth{v,2) U VuiPi^D — 6). In G^', tree Ti^j is rooted at node qi, 


so 


Vq^, {v, D—A) = Vti i{v, 2)UVH{qi, P—6). By Fact 5.2, it follows that Vg^{v, P—4) = Vg^ {v, P—4). 
So, V outputs the same sequence of outgoing ports after executing ELEGT in both Go- and Go-'. 
Assume, without loss of generality, that in Go-, the node elected by v after executing ELEGT is 
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closer to cq than to ci. Since the length of u’s output is the same for both and Go-', the node 
elected by v after executing ELECT in G^j/ is closer to ci than to cq. However, in both G^ and 
Go', tree Top is rooted at node p*, and the leaf v' of tree Tq^i outputs the same sequence of outgoing 
ports. Hence, v' elects the same node in both Go and Ga->. Thus, in at least one of Go or Go', 
nodes v and v' do not elect the same node, a contradiction. This concludes the proof of the claim. 

Since there are different subsets of {2 ,..., fc}, the number of different advice strings is at least 
2^~^. It follows that the size of advice is Cl{k) = Q{n/D). 

Finally, for any a, we prove that ^(Go) < h + 1, which implies that ^{G^) < r. It is sufficient 
to show that, using h + 1 rounds of communication, an arbitrary node v given a map of Go can 
compute where it is located in the map. The distance from any node u to a node on the edge 
{co. Cl} is at most h + 1. Let c{v) be the endpoint of the edge {cq, ci} that is closest to v. It follows 
that V{v,h + 1) contains c{v) (which can be identihed by finding the closest node to v that has 
degree k + 1). Consider two cases. If u = c{v) or v is in the the subtree Ti of V (c(u), h + 1) induced 
by the nodes that can be reached from v via a path starting with port 1 , then Ti G {Pi, Qi} can 
be used to identify v’s position on a map of Go-. Indeed, v need only check the length of Ti, since 
iTil = h if and only if Ti = Pi. In the second case, let i be the final port number in the port 
sequence seq{v, c{v)). The subtree of V (c(u), h + 1) induced by the nodes that can be reached from 
c{v) via a path starting with port 1 has To,* or Ti^j rooted at the node at distance h — 2 from c{v). 
By identifying which of these two trees appears, v can identify its position on a map of Go-. □ 


6 Time a ■ diam for constant ct < 1/2 

In this section, we prove tight upper and lower bounds of 0(n) on the minimum size of advice 
sufficient to perform leader election in time aD for constant a < 1/2 in n-node trees T with 
diameter D G a;(log^ n) and ^(T) < aD. The upper bound, which holds for all values of D, is given 
by the following result. 

Proposition 6.1 Leader election in every non symmetric n-node tree T is possible in time C{T), 
using 0{n) bits of advice. 

Proof. We use the following observation of Chierichetti [9]. An n-node anonymous tree can be 
coded by an ordered pair of two sequences ( 0 , ■ 0 ) in such a way that trees that are not port¬ 
preserving isomorphic get different codes. Starting from any node considered as a root, perform 
a DFS traversal of the tree, visiting children of any node in the increasing order of ports at this 
node. The binary sequence f has length 2(n — 1) and is constructed as follows. Whenever an edge 
is traversed down the tree, write 0, and whenever it is traversed up the tree, write 1. The sequence 
^|J has length n — 1 ; let (ui,... ,Vn-i) be the sequence of nodes other than the root in the order 
of first visit in the traversal, and let the term of ip be the entry port number at the first visit 
of Vi. There are at most possible sequences (p. The number of possible sequences ip for 

each (p is bounded above by the product of degrees of all nodes other than the root and the sum of 
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these degrees is at most 2(n — 1). Hence the number of possible sequences ^|J for each cj) is bounded 
above by 2”“^. Hence, there are at most • 2"'“^ possible codes. The code of a tree is the 

lexicographically smallest pair (cj), over all choices of the root. 

We solve leader election as follows. The advice is the code of the tree. It has size 0{n). Using 
this code, all nodes construct a faithful map of the tree. Using the map, they can perform leader 
election in time ^(T), by the definition of this parameter. □ 


The next result is a matching lower bound when the diameter is not too small compared to n. 


Theorem 6.1 Let D < n be positive integers, such that D G uj{log^ n). Let a < 1/2 be a constant 
and let r = [aD\. There exist trees T with size 0(n), diameter D, and ^{T) < t, for which any 
leader election algorithm working in time r requires advice of size H(n). 

Proof. We define a class T of trees T with size 0(n), diameter D, and f(T) < r such that the 
minimum size of advice needed by an arbitrary algorithm ELECT solving leader election in time 
T for this class is H(n). 

We consider the case where D is even (the case where D is odd is obtained by adding a single 
edge to the construction for diameter D — 1). We begin by constructing a “template” tree G from 
which all trees in our class 7~ will be constructed. Tree G itself is not a valid instance for leader 
election since some of its port numbers are undefined, but each tree in our class is obtained from G 
by filling in the missing port numbers. Our construction of G depends on a set Ai of trees that we 
call markers. Each marker is a tree of height 2 with hxed port numbers, and each marker appears 
in G only once. The purpose of the markers is to guarantee f,{T) < r by enabling each node to 
determine its location in a map of G. Later, we will specify how to define the markers so that we 
have as many of them as we need. 

Let k = \2n/D'\. Our template G consists of a central node c, and, for each i G {0,..., A: — 1}, 
a path Pi of length D/2 with one endpoint equal to c. For each i G {0, ... ,k — 1}, the other 
endpoint of Pi will be denoted by pi, and the hrst port number of the sequence seq{c,pi) is equal 
to i. Let / be the integer in {r + l,r + 2} that has the same parity as D/2. The integer / is the 
number of nodes on each path Pi (without c) whose incident port numbers are hxed in G. More 
specihcally, the hrst 2/ — 1 port numbers of the sequence seq{pi, c) are (0,1, 0,1,..., 0). Finally, 
for each i G {0, ... ,k — 1}, we place one marker rooted at distance 2 from pi, and, from this node, 
we place one marker rooted at every (r — 4)*^ node on the path towards c. We ensure that no two 
markers from Ai are used twice in the construction. See Figure [TT] for an illustration of G. 

We are now ready to construct the class T. Let s = ^ — f. Hence s G Q{D). We note that, on 
each path Pi in G, there are s consecutive nodes (starting at c’s neighbour on Pi) whose incident 
port numbers are not dehned. Call these nodes Ujq,..., Vi^s- Since / was chosen to have the same 
parity as it follows that s is even. For j G {1,... ,s/2}, let Cij be the edge {vi^ 2 j-i,Vi^ 2 j}- We 
say that edge eij is set to 0 (respectively, set to f) if the two ports corresponding to edge eij are 
equal to 0 (respectively, equal to 1), and the ports at Vi^ 2 j-i and Vi^ 2 j not corresponding to edge eij 
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Figure 11: Template G constructed in the proof of Theorem 6.1 
markers from M. 


where Ma, Mb, Me are different 


are equal to 1 (respectively, equal to 0). We now demonstrate how to obtain a tree by defining 
a labeling function L. In particular, a labeling function L : {0 ,... ,k — 1} x { 1 ,..., s/ 2 } ^{ 0 , 1 } 
maps pairs of integers {i,j) to 0 or 1 . The tree Gl is obtained from G by setting each edge ejj 
to L{i,j). Note that, for any labeling function L, the port labeling of tree Gl is valid. Further, 
for any distinct labeling functions L,L', the trees Gl and Gl' are distinct. The class T is defined 
as the class of trees Gl for all labeling functions L. The following result will help us compute the 
number of different advice strings required by algorithm ELECT for the class T. 


Claim 6.1 For any distinct labeling functions L, L', the advice strings provided to algorithm ELECT 
for trees Gl and Gy must he different. 

We prove the claim by contradiction. Assume that the advice strings assigned to and Gy are 
the same. Since L and L' are distinct, there exists an i G {0,..., A; — 1} and a j G {1, ..., s/2} such 
that L{i,j) / L'{i,j). Without loss of generality, assume that L{i,j) = 0. Choose an arbitrary 
i' G {0, ... ,k — 1} — {i}. We consider the executions of algorithm ELECT by nodes pi and p^ in 
both Gl and Gy- First, note that the first 2 / — 1 > 2 t + 1 port numbers of seq{pi,c) and seq(pi’,c) 
were fixed in G. It follows that V{pi,T) is the same in both Gl and Gy- Similarly, V{pl,t) is 
the same in both Gl and Gy- Since Gl and Gy are assigned the same advice, it follows that pi 
outputs the same sequence of outgoing ports, say a, after executing ELECT in G^ as it does after 
executing ELECT in Gy- Similarly, pj/ outputs the same sequence of outgoing ports, say a', in 
both executions. Since the sequence outputted by pi is the same in both executions, corresponds 
to simple paths in both of them, and the port numbers at c are fixed, pi elects the same node in 
Gl and in G/. The same is true for pj/. Therefore, it must be the case that, for one of a or a', the 
corresponding path crosses edge Cij using the same outgoing port number in the execution in Gl 
as in the execution in Gy- However, the two ports corresponding to are labeled 0 in Gl and 
labeled 1 in Gl', a contradiction. This proves the claim. 

By Claim |6.1t the number of different advice strings is equal to the number of distinct labeling 
functions, i.e., But 5 = ^ — / >y — (t + 2)>^ — [aD\ — 2 G &{D), and k = \2n/D '], so 

the number of labeling functions is 2'^"' for some positive constant c. It follows that the minimum 
number of bits sufficient to encode the advice strings is Ll{n), as required. 
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Next, we show that, for each T £ T, we have ^{T) < r. By construction, for each node v in T, at 
least one of the following must be true: 

1. u is contained in a marker 

2. u is located between two consecutive roots of markers 

3. for some i G {0,k — 1}, v = pi, or v is located between the root of a marker and pi, with 
no root of a marker between v and pi 

4. V = c, or, V is located between the root of a marker and c, with no root of a marker between 
V and c 


In case (1), v is at distance at most 2 from the root of the marker. Since the markers are spaced 
distance (r — 4) apart, the distance from u to a neighbouring marker is at most r — 2. Therefore, 
V{v,t) contains the marker containing v as well as at least one neighbouring marker. In case (2), 
since the markers are spaced distance (r — 4) apart and the height of each marker is 2, it follows 
that V{v,t) contains the two markers closest to v. In both cases, using these two markers, v can 
determine where on a map of T it is located. In case (3), since we placed a marker rooted at 
distance at most 2 from each p*, it follows that V(v, r) contains the marker closest to v. Using this 
marker, v can determine where on a map of T it is located. In case (4), by the construction of G, the 
distance between v and c is at most r — 4, so c is in U(u, r). Node c is uniquely recognizable as the 
node of degree k, as long as we ensure that no root of a marker has degree k (see the specification 
of the markers below). 

Hence in all cases v can locate itself in the map within time r, so leader election can be done in 
this time given the map, thus proving ^(T) < r. 


It remains to describe the set of markers. First, observe that the total number of markers needed to 


define the template G is bounded above by k 


D 

2(r-4) 


This is because, for each i £ {0,k — 1}, 


path Pi in G contains § nodes (other than c), and a marker is placed every (r — 4) nodes. So the 


number of markers needed is bounded above by 4 (^) ( 2 (t^ 4) 


4n 

T—4 * 


Let y = [log;^^]. Consider the family X of all trees of height 2 with z = leaves and whose 
root has degree x = for sufficiently large n. For each of these trees, label the ports 

at the root node using {2,..., x + 1} (the port numbers 0 and I are reserved to label the ports 
on path Pi.) For each node at the first level, the port leading towards the root node is labeled 0. 
The number of trees in X is equal to the number of ordered partitions of the set of leaves into x 
parts (a partition specifies the number of leaves adjacent to each of the x nodes at the first level 
of the marker.) So the number of trees is equal to yyg now show that 

^ Note that log (^) = logz^ — logx! > xiogz — xlogx > \y^^‘^~\ logp^ — \y'^^‘^~\ logy^/^ = 

2|'y3/2i iQgy _ 7|"y3/2-| ^ ^ follows that ^ Therefore, by taking an arbitrary 

subset A4 of size at least of the family X, we have a sufficiently large set of markers to define 
G. Note that, since the size of each marker is x + z, the total number of nodes needed to define the 
markers in G is at most + 2/^) < :^(2/^) < log^ Since D £ w(log^ n), it follows 
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that r — 4 = \aD\ — 4 G a;(log^ Hence, for sufficiently large n, the number of nodes needed 

to dehne the markers is o(n). Thus, the size of G (and, therefore, of each tree in T) is in 0(n). □ 


Proposition |6 .1 1 and Theorem 6.1 imply the following corollary. 


Corollary 6.1 Let D < n be positive integers, such that D G a;(log^ n). Let a < 1/2 he a constant 
and let r = [aHj. Let T he the class of trees T with size 0(n), diameter D, and f{T) < r. The 
minimum size of advice to perform leader election in time r for the class T is 0(n). 


7 Discussion of open problems 

For time values diam — 1 and diam — 2, we gave tight bounds (up to constant factors) on the 
minimum size of advice sufficient to perform leader election for trees of diameter diam. For time 
in the interval [/3 ■ diam, diam — 3] for constant /3 > 1/2, we gave bounds leaving a logarithmic gap, 
except for the special case when diam is even and time is exactly diam — 3. This yields the first 
problem: 

PI. Find close upper and lower bounds on the minimum size of advice in the special case when 
diam is even and time is exactly diam — 3. 

As a step in this direction, we prove the following lower bound which implies an exponential jump 
in the minimum size of advice between time diam — 2 and time diam — 3 when diam is even and 
constant. 

Proposition 7.1 Let n be a positive integer and let D he an even positive integer constant such 
that D > 6. There exists a class T of trees T with size 0(n), diameter D, and f{T) < D — 3, such 
that every leader election algorithm working in time D — 3 on the class T requires advice of size 
logn). 

Proof. Let h = D/2 and let A = [(yn)^/^], where 7 is a sufficiently large constant greater than 
1 so that A > 2. We start by dehning a tree G from which we will derive the class T. At a high 
level, trees in T will be defined in such a way that, for some leaves v, there are many nodes in the 
view V{v,D — 3) such that v cannot be sure which of them is the central node. 

To this end, in the construction of G, we will use special subtrees, called confusion subtrees, as 
building blocks. For any i G {0,..., A — 1}, a confusion subtree avoiding port i, denoted by Ti{x), 
is defined recursively with its height x as parameter. For the base cases, let Tj(0) consist of a single 
node with degree 0, and let Ti{l) consist of a node Wi with A degree-1 neighbours, such that the 
ports at Wi are labeled by integers from {0,..., A} \ {z}. For any x > 1, define Ti(x) as follows: 

1. let Wi be the root node with degree A 
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2. Label one of Wi’s neighbours as ca- Attach a path of length x — 2 with ca as one endpoint, 
and let pA be the other endpoint of this path (if x = 2, set pA = ca-) Set the port sequence 
seq{wi,pA) equal to (A, 0,1,0,1,, 0,1, 0). 

3. for each j £ {0,..., A — 1} \ {i}, label one neighbour of Wi as Cj. Set the two ports corre¬ 
sponding to edge {wi, Cj} equal to j. For each A;G{0,...,A — add a neighbour bj^k 

to Cj. Set the two ports corresponding to edge {cj,bj^k} equal to k. At each 6 ^^, attach a 
copy of Tfc(x — 2) by identifying the root of Tk{x — 2) with bj^k- 


See Figure [T^ for an illustration of Ti{x). Finally, let G consist of a root node c of degree A, with 
the roots of To(/i — 1),..., TA-i{h — 1) as its neighbours. For each i G {0,..., A — 1}, we denote 
by Wi the root of Ti{h — 1), and we denote by qi the node pA defined at step in the construction 
of Ti{h — 1). For each fG{0,...,A — 1}, the two ports corresponding to the edge {c,Wi} are set 
to i, and, A -|- f + 1 leaves are added as neighbours of qi. See Figure [T^ for an illustration of G. 



Figure 12: Tree G constructed in Proposition |7.1[ with A = 3 and /i = 4. Confusion trees 
To(3), ro(l), ri(l), r 2 (l) can be found in the dashed boxes. 

We can now define the class T. For every permuation a of the integers {1,...,A — 1}, we obtain 
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a tree from G by applying a to the subtrees rooted at qi,, qa-i- More specifically, for each 
i G {1,..., A — 1}, the number of neighbouring leaves of qi is changed to A + a{i) + 1. It follows 
that, when a is the identity permutation, Ga = G. Further, note that the only differences between 
two trees G^ and G^i are the degrees of some nodes among qi,..., qA-i- The class T is defined as 
the set of trees G^ for all permutations a of the integers A — 1}. By counting the number 

of such permutations, it follows that \T\ = (A — 1)!. 

We now determine the number of different advice strings needed by any algorithm ELEGT that 
solves leader election in time D — 3 in trees from the class T. The following result shows that the 
number of different advice strings needed by algorithm ELEGT is |T|. 

Claim 7.1 For any two distinct permuations a, a' of the integers {1,..., A — 1}, the advice strings 
provided to algorithm ELEGT for trees G^ and G^i must be different. 

We prove the claim by contradiction. Assume that, for some permutations a a', the advice 
strings assigned to Go- and Go-' are the same. The proof proceeds by finding a leaf £o- in Go- and a 
leaf in Go-' such that the two leaves have the same view at distance H —3 in their respective trees. 
Since the two trees are assigned the same advice, this implies that the executions of ELEGT by 
these two leaves output the same port sequence to elect a leader in their respective trees. Further, 
we show that go outputs the same port sequence in the execution of ELEGT in both Go- and 
Go-'. Finally, we show that ELEGT fails in at least one of Go- or Go-', which gives the desired 
contradiction. 

Definition of and Let j G {1,..., A — 1} such that a{j) / cr'ij)- Let / G {1,..., A — 1} 
such that = cj(j). Let be the leaf adjacent to qj in Go- such that the port at qj corresponding 

to the edge {gj,^o-} is 1. Similarly, let be the leaf adjacent to qji in Go-' such that the port at 
qj! corresponding to the edge {gj',f'o-'} is 1. 

Showing that and output the same port sequence. Since G^ and G^j/ are assigned 
the same advice, it is sufficient to show that VG^{ia,T) — 3) = Vg^,(^ o-O-D — 3). To prove this 
fact, note that, since a{j) = cr'(j'), it follows that t'o-’s neighbour qj in Go- has the same degree as 
t'o-'’s neighbour qj> in Go-' (i.e., both of these degrees are equal to A + a{j) + 2.) It follows that 
Vg^{(-<tA) = (^o-G !)• Next, by the constructions of Tj{h — 1) and Tjfh — 1), we have that 

seq{qj,Wj) = (0,1,0,1,..., 0, A) in Go- and that seq{qji,Wji) = (0,1, 0,1,..., 0, A) in Go-'. Since 

\path{qj,Wj)\ = h — 2 = \path{qjGWji)\, we have so far shown that VG^{ia,h — l) = Vg^, (^o-G ^ ~ !)• 
Next, the confusion trees were constructed specifically to satisfy the following property: for any 
port sequence of length 2/i — 3 starting at Wj in Go- that does not begin with port A, the same 
sequence appears in Gcr' starting at Wj/. It follows that VG^{wj, h — 2) = VG^,{wji, h — 2). Finally, 
since wj is the only node at distance h — 1 from ia- in Go-, we have Vg^(^o-, h — 1) U VG^{wj, h — 
2) = VG^{ia,D — 3). Similarly, since Wj> is the only node at distance h — I from in Go-', 

we have Vg — 1) U Vg liwjGh — 2) = Vg — 3). Therefore, we have shown that 

Vg^{L, D -3) = Vg^, (4g T> - 3), as desired. 

Showing that go outputs the same port sequence in the execution of ELEGT in both G^ 
and Go-'. Since Go- and Go-' are assigned the same advice, it is sufficient to show that VG^(go, D — 
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3) = Vg^, {qo, D — 3). To prove this fact, note that, since d{qj, c) = h — 1 for all j G {0,..., A — 1}, 
it follows that, for all i G {1,..., A — 1}, d{qo, qi) = 2h — 2 = D — 2. In particular, this means that 
qi,..., qA-i are neither contained in Vc„{qo, D — 3) nor contained in Vg^, {qo, D — 3). Since the 
only differences between and Go-' are the degrees of some nodes among qi,, qA-i, it follows 
that {qo, D — 3) and Vg^, {qo,D — 3) must be equal. 

Showing that ELECT fails in at least one of G^ or Go-'- To obtain a contradiction, we assume 
that ELECT correctly elects a leader in both G^ and Go-'. First, suppose that elects a node in 
Tjih — 1) in Ga, and elects a node in Ty(h — 1) in Go-'- It follows that, in G^, node qo elects 
a node in Tj{h — 1) by outputting some sequence u whose term is j. Similarly, in Go-', node 
qo elects a node in Tji{h — 1) by outputting some sequence u' whose term is / 7^ j. However, 
this means that u ^ u', which contradicts the fact that qo outputs the same port sequence in the 
execution of ELECT in both Go- and Go-'- So, we have shown that it is not the case that both 
ifj elects a node in Tj{h — 1) in Go- and elects a node in Tji{h — 1) in G^^. So, without loss of 
generality, we may assume that, in Go-, node ia- elects a node that is not in Tj{h — 1). In particular, 
this means that the term in the sequence s outputted by rnust correspond to edge {wj,c} 
(since, otherwise, the path corresponding to this sequence would not contain c). Hence, the 
term in sequence s is equal to j. We showed above that in Ga and -£0' in Go' output the same 
port sequence, so the term of £o'’s output is also equal to j. Since j / j', it follows that the 
j^th g^gg |;}jg path corresponding to output does not correspond to edge {wji,c}. Hence, 
in Go', node elects a node in Tji{h — 1). This elected node is at distance |s| — {h — 1) from Wj/ 
since the first h — 1 ports in s correspond to the path from £0' to Wji. Since d{qo, Wji) = h, we have 
that, in Go', the length of go’s output is h + [|s| — {h — 1)] = |s| + 1. On the other hand, in Go, 
the node elected by is not m. Tj{h — 1), which means that the elected node is at distance \s\ — h 
from c (since the first h ports in s correspond to the path from £0 to c.) It follows that, in Go, 
the length of go’s output is at most h — 1 + [|s| — h] = |s| — 1, i.e., shorter than its output in Go'- 
This contradicts the fact that go outputs the same sequence in the execution of ELECT in both 
Go and Go'- Therefore, our assumption that ELECT correctly outputs a leader in both Go and 
Go' was false, so ELECT fails on at least one tree in T. 


This contradicts the correctness of ELECT, so our assumption that the same advice is provided 
for Go and Go' must be wrong. This concludes the proof of the claim. 


By Claim 7.1 there are \T\ = (A — 1)! different advice strings, and hence the size of advice is at 
least log((A — 1)!) G n(Alog A). Since A = [(7^)^^"^] , we get a lower bound Ll{'n?'/^ logn) on the 
size of advice. 


The following results demonstrate that leader election is solvable in each Go in time D — 3 (given 
its map), and that the number of nodes in Go is in 0(n). 


Claim 7.2 ^(Go) <D-3 


To prove the claim, we show that every node can identify itself in a map of Go, and thus can 
elect the central node c. First note that c is at distance at most h from every node in Go, so 
Vg^{c,D — 3) = VG„{c,h) is equal to Go. Therefore, in time D — 3, node c can identify itself in 
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a map of G^- For any v ^ c in G^, node v is located in TiQi — 1) for some i G {0,..., A — 1}. 
Since Ti{h — 1) has height h—1, the distance between any two nodes in Ti{h — 1) is at most D — 2. 
It follows that Vg„{v,D — 3) contains every node in Tj(/i — 1) except possibly some leaves. In 
particular, Vg^{v, D — 3) contains the path between nodes Wi and g*. This path is the only induced 
subtree of Ti{h — I) consisting of a path of length h — 2. It follows that v can identify this path 
in Vg„{v,D — 3). Hence, it can identify its endpoints and it sees their degrees. Therefore v can 
identify qi, as it is the only node in G^ of degree A + a{i) + 2. It follows that v can identify itself 
in a map of G^- This concludes the proof of the claim. 


Claim 7.3 The number of nodes in Ga is in 0(n). 


To prove the claim, note that the size of G^ is equal to 1 + (3A^ + A)/2 + A • \Ti{h — 1)|. We first 
prove that \Ti{x)\ < 3A® by induction on x. In the base cases, |Tj(0)| = 1 and |Tj(l)| = A + 1, 
so \Ti{x)\ < 3A* in these two cases. Now, suppose that x > 2, and that, for all y < x, we have 
\Ti{y)\ < 3A*'. From the construction of Ti{x), we observe that the size of Ti{x) is 1 + A + (x — 2) + 
(A —l)^-|rj(x —2)|. By the induction hypothesis, this is at most l + A + (x —2) + (A —1)^(3A'^“^) = 
1 + A + (x — 2 ) + [3A* — 6 A*“^ + 3A^“^] < l + A + (x — 2 ) + 3A^ — 3A^“^. Since each of the first 
three terms is bounded above by A*“^, we have shown that Tj(x) has size at most 3A®. Therefore, 
the size of G^ is at most 1 + (3A^ + A )/2 + 3A^ = 6 (( 7 n)^/^)^ < Gyn. This completes the proof 
of the claim. 


□ 


For time a ■ diam for any constant a < 1/2, we showed bounds with tight order of magnitude of 
0(n), except for diameter diam G O(log^n). This yields three open questions. 

P2. Find close upper and lower bounds on the minimum size of advice for time very close to 
diam/2, i.e., diam/2 ± o{diam). 

P3. Find close upper and lower bounds on the minimum size of advice for time below half of the 
diameter when the diameter is in 0 (log^ n). 

P4. Find close upper and lower bounds on the minimum size of advice when time is very small, 
e.g., logarithmic in n. 

This last problem has an intriguing twist. At first glance it would seem that the answer to it, at 
least for diameter in a;(log^n), is 0(n). Indeed, the upper bound 0{n) on the size of advice holds 
in this case as well (with the same proof as in Section 6 ), and the lower bound H(n) proved for 
time a ■ diam for any constant a < 1/2 should be “even more true”; since decreasing the allocated 
time makes the task more difficult, the required amount of advice should not decrease. Perhaps 
surprisingly, this argument overlooks the following subtlety. We should recall that, for a given time 
r, we seek solutions of our minimum advice problem only for trees T with f,(T) < r because, for 
other trees, leader election is infeasible in time r with any amount of advice. However, for small 
values of r, the restriction (,{T) < r could sometimes leave so few trees under consideration that 
more efficient advice than for larger values of r is sufficient. Is this really the case? 
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